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DISCUSSION  AND  SUMMARY 


The  third  meeting  of  the  Coordinating  Group  on  Modern  Control  Theory  was 
characterized  by  the  broad  range  of  subject  matter,  including  tank-helicopter 
fire  control  and  their  microprocessor  implementation,  tactical  missile  guidance 
and  control,  disturbance  accommodation  control,  prediction-estimation  design 
and  theory,  Martin  robustificatton  of  Kalman  filters  for  heavy  tailed  non- 
gaussian  disturbances  in  monopulse  radar  tracking  systems,  credibility  of 
computer  models,  time-to-go  algorithms  for  advanced  guidance  laws  and  stability 
analysis  for  power  systems. 

Considerable  interest  was  shown  in  the  experience  to  date  on  the  micro¬ 
processor  implementation  of  the  fire  control  algorithms  carried  out  by  members 
of  the  US  Army  Armament  Research  and  Development  Command.  The  work  of  Professor 
C.  D.  Johnson  on  disturbance  accommodation  control  and  applied  by  Dr.  William 
Kelly  of  the  US  Army  Missile  Command  and  Norman  Coleman  of  US  Army  Armament 
Research  and  Development  Command  clearly  have  broad  applications  to  current 
fire  control  activities  in  the  Army.  A  very  high  degree  of  interest  was 
demonstrated  by  the  concerns  of  Professor  Naim  Kheir  of  the  University  of 
Alabama  in  Huntsville  in  his  paper  on  credibility  of  computer  models.  The 
Technical  Committee  on  Model  Credibility  of  the  Society  of  Computer  Simulation 
have  proposed  adapting  several  measures  to  quantify  how  well  a  model  matches 
the  performance  of  reality  being  modeled.  The  Theil  inequality  coefficient 
(TIC)  was  recommended  for  missile  systems  validation.  Different  measures  of 
credibility  were  proposed  for  a  wide  range  of  problems. 

The  work  of  Professor  James  Lea thrum  of  Clemson  University  in  developing 
a  design  methodology  for  predictors  in  fire  control  systems  and  the  work  of 
Professor  Nan  Loh  on  generalized  observers  should  prove  to  be  significant 
contributions  to  Army  activities  in  improving  weapon  system  effectiveness. 

The  results  of  ALPHATECH  on  disturbed  recticle  sight  clearly  demonstrate  that 
first  order  lead  angle  prediction  is  ineffective  against  manuevering  targets 
and  that  the  disturbed  rectile  exhibited  a  significantly  worse  pointing 
performance  than  the  stabilized  sight. 

Joe  Craig  of  Adaptronics,  Inc.,  provided  promising  results  on  trainable 
adaptive  learning  network  (ALN)  guidance  lines  which  are  computationally 
simple,  uses  only  passive  observables  and  can  be  realized  using  current 
microprocessor  capabilities.  The  paper  by  Captain  Tom  Riggs  of  the  Air  Force 
Academy  demonstrated  that  the  time-to-go  algorithm  which  uses  range  over  range 
rate  for  estimating  time-to-go  severely  limited  missile  performance  and  that 
the  best  performing  algorithm  is  the  simple  closed  form  algorithm  that  forces 
the  command  missile  axial  acceleration  to  be  equal  to  the  actual  missile  axial 
acceleration. 

The  work  done  by  TASC  and  University  of  Pennsylvania  on  air  defense  fire 
control  (GLAADS)  concluded  that  Markov  chains  predictors  offer  noise  immunity 
over  AR  or  ARIMA  models  and  that  modern  control  techniques  in  design  of  AAA  fire 
control  provided  improvements  in  the  performance  and  stabilization  of  the 
weapon  system. 


V 


Work  currently  being  pursued  by  the  Naval  Weapon  Center  at  China  Lake 
on  the  application  of  robust  statistics  to  monopulse  radar  trackers  is 
exciting  and  has  the  potential  of  significant  improved  performance  in  the 
presence  of  heavy  tailed  non-gaussian  noise. 

Overall,  the  Chairman  was  pleased  by  the  free  exchange  of  information 
and  the  strong  interaction  of  participants.  The  Chairman  at  the  close  of 
the  meeting  announced  that  the  fourth  meeting  of  the  Coordinating  Group  on 
Modern  Control  Theory  will  be  held  during  the  week  of  25  October  1982  at 
Oakland  University.  Rochester,  Michigan. 


HERBERT  E.  COHEN 
Chairman 
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ANALYSIS  OF  TANK  FIRE  CONTROL  SYSTEMS 
BY  OPTIMAL  CONTROL  THEORY* 
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and 

Hal  Burke 
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INTRODUCTION 

The  primary  purpose  of  a  fire  control  system  (FCS)  is  to  track  and 
control  the  firing  at  a  moving  (and  possibly  maneuvering)  target. 
Often,  it  is  desired  to  include  a  gun  leading  mechanism  in  the  FCS 
in  order  to  compensate  for  the  fired  projectile's  time-of-f light 
toward  the  moving  target.  Recently,  a  research  effort  in  which 
several  fire  control  system  designs  were  analyzed,  was  reported 
in  (1).  The  control  designs,  all  of  which  include  (first  order) 
lead-angle  prediction  capability,  consisted  of  both  disturbed 
reticle  (DR) ,  and  stabilized  sight-director  systems  (SS) .  Two 
major  attributes  distinguish  a  DR  from  a  director-type  SS  system: 

1.  The  lead  predictor  is  included  in  the  DR's  visual  loop,  and 

2.  A  gun-to-recticle  crossfeed  path  exists  in  the  DR  system;  thus, 
coupling  the  gun  dynamics  into  the  visual  loop.  The  director 
system,  on  the  other  hand,  is  disengaged  from  both  predictor  and 
gun  dynamics . 

Since  the  human  operator  is  an  integral  part  of  the  FCS,  it  is  ^ 
necessary  that  any  performance  analysis  (tracking  and  gun-pointing 
in  particular)  be  approached  with  appropriate  gunner  modeling.  In 
this  paper,  the  performance  of  two  representatiye  generic  fire 
control  systems,  a  disturbed  reticle  and  a  stabilized  sight- 
director,  is  analyzed.  The  tool  through  which  these  systems  are 
analyzed  is  the  well  known  Optimal  Control  Model  (OCM) ,  (2) -(3) . 

The  OCM  is  a  nomative  performance-oriented  model  of  the  human ^ 
operator  engaged  in  a  control  task.  In  the  case  at  hand,  it  is  a 
gunner  performing  a  tracking  task. 

*  Th i s  work  was  supported  by  Army  Materiel  Systems  Analysis 
Activity  under  Contract  Number  DAAK30-80-C-0075 
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First,  the  dynamical  properties  of  the  two  FC  systems  are  dis¬ 
cussed.  A  brief  overview  of  the  gunner's  (optimal  control)  model 
follows,  and  the  performance  measures  employed  in  the  analysis 
are  introduced.  The  frequency-and  time-domain  modeling  approach, 
and  the  ensuing  modeling  results  are  then  discussed. 


DESCRIPTION  OF  THE  FIRE  CONTROL  SYSTEMS 

The  modeling  effort  is  initiated  by  selecting  representative 
generic  FC  systems  for  the  disturbed  reticle  and  the  stabilized 
sight-director  designs.  In  the  following,  the  basic  characteris¬ 
tics  of  these  systems  are  described.  We  begin  with  the  disturbed 
reticle  design. 

DISBURBED  RETICLE  (DR)  SYSTEM 

The  purpose  of  a  tank  fire  control  system  is  to  accomplish  the 
functions  of  tracking,  target-coordinate  estimation  and  predic¬ 
tion,  and  gun  pointing.  In  a  disturbed  reticle  design,  these 
functions  are  intermixed  in  a  complex  dynamic  interaction  of  the 
gunner,  sight,  and  gun  turret.  The  idea  of  a  disturbed  reticle 
fire  control  system  is  to  automate  the  computation  of  the  correct 
lead  for  a  constant  velocity  target. 

The  physical  behavior  the  the  DR  system  is  the  following.  The 
gunner,  seated  in  the  turret,  rotates  with  the  gun.  He  observes 
the  target  and  reticle  through  his  sight,  and  commands  the  rota¬ 
tion  of  the  gun  with  a  hand  control  to  keep  the  reticle  on  the 
target.  In  constant  velocity  tracking,  the  rate  of  rotation  of 
the  gun  is  proportional  to  the  displacement  of  the  hand  controls. 
Thus  the  hand  control  displacement  should  ideally  be  proportional 
to  the  target  angular  rate,  e,p,  if  the  gunner  is  successfully 
keeping  the  reticle  on  the  target. 

The  mechanical  hand  control  position  is  transferred  to  an  elec¬ 
trical  output,  u,  which  in  turn  acts  as  an  input  to  the  reticle 
and  turret  subsystem.  The  turret  servo^ should  ideally  cause  the 
gun  to  lead  the  target  by  an  angle  X^=Te^,  where  T  is  the  pro¬ 
jectile  time-of-f light .  This  will  insure  that  it  is  possible  to 
hit  a  target  moving  with  a  constant  angular  rate.  In  the  ideal 
case  of  steady-state  tracking  of  a  constant  angular  velocity 
target,  the  gunner  does  not  perceive  the  lead  but  simply  keeps  the 
reticle  on  the  target. 

The  disturbed  reticle  design  considered  is  mechanized  with  a  turret 
servo,  a  hand  control  filter,  a  reticle  (sight)  servo  and  a  cross¬ 
feed.  The  hardware  required  to  implement  the  disturbed  reticle 
concept  is  shown  in  Fig.  1.  The  reticle  is  projected  in  the 
gunner's  line  of  sight  by  a  movable  beam  splitter  and  mirror.  In 
addition,  light  from  a  light-emitting  diode  (LED)  is  reflected 


3 


off  this  mirror  into  an  optical  sensor  to  provide  a  position 
reference  signal  for  the  mirror  servo.  Since  the  lead  screw 
positions  the  photosensor  array,  the  (uncompensated)  input  to  the 
mirror  servo  is  proportional  to  the  lead  screw  angle,  9tc-  How¬ 
ever,  this  input  is  modified  by  combining  it  with  a  signal  from 
an  electronic  crossfeed.  The  resulting  signal  is  used  to  command 
the  mirror  servo  which  will  null  on  the  LED  and  position  the 
reticle  in  the  gunner's  line  of  sight.  The  lead  screw  tachometer 
output  and  the  command  serve  as  inputs  to  simultaneously  position 
the  gun  at  angle  6^. 

The  functional  block  diagram  representing  this  design  (DR)  is  shown 
in  Fig.  2.  The  command  u  is  applied  to  both  the  lead  screw  and 
gun  turret  servomechanisms.  The  lead  screw  servo  is  a  position 
servo  which  deflects  the  optical  sensor  an  amount  Tu  with  respect 
to  the  gun.  The  deflection  is  proportional  to  the  rotation 
of  the  lead  screw  and  is  measured  by  a  potentiometer.  A  tachometer 
is  used  for  stabilization  and  to  develop  a  lead  command  to  the  gun 
rate  servo. 


Fig.  1.  Physical 

Configuration , 
Disturbed  Reticle 
Design  (DR). 


Fig.  2.  Functional  Diagram, 
Disturbed  Reticle 
Design  (DR). 


The  optical  sensor  picks  up  the  deflected  light  angle  and  drives 
the  mirror  servo  until  the  LED  light  is  maximum.  The  mirror  and 
gun  servos  also  respond  to  the  crossfeed  compensation.  The  gun 
servo  is  rate  compensated  by  gyro  feedback.  It  is  commanded  by  u 
and  the  crossfeed  network. 
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The  crossfeed  network  is  central  to  the  three  individual  servos  in 
two  ways.  First,  it  is  an  integral  part  of  the  gun  servo  compen¬ 
sation  network,  and  second,  it  provides  crossfeeds  to  the  mirror 
servo  loop  to  slow  the  mirror  when  the  gun  servo  cannot  keep  up 
with  its  command.  This  is  seen  by  the  following  argument:  when 
the  gun  servo  error  is  large,  indicating  gun  lag,  the  crossfeed 
signal  is  maximum  and  slows  the  mirror  due  to  the  negative  sign. 
This  tends  to  diminish  the  nominal  lead  Tu.  A  simplified  state 
variable  model  (closed-loop)  of  the  DR  design  is  shown  in  Fig.  3. 


Fig.  3.  Disturbed  Reticle  Design,  State  Variable  Model. 


In  this  representation  we  include  the  gunner  in  the  control  loop. 
Note  that  the  (fast)  optical  sensor /mirror  motor  drive  dynamics, 
as  well  as  the  system's  inherent  nonlinearities  (1),  are  neglegted 
in  the  state  space  model  and  in  the  subsequent  analysis.  The 
pertinent  state  variables  are  defined  as  follows:  0,^,  =  target 
commanded  angle;  6^  =  gunner's  reticle  (sight)  angle;  Qe^Qt'^R 
tracking  error;  u  =  gunner's  control  signal;  Bg  =  gun  angle. 

Some  of  the  characteristics  of  this  configuration  are  summarized 
below. 

1.  The  dynamics  of  the  lead  servomachanism  are  parameterized  by 
its  time  constant,  Xj  =  .01  sec.  The  prediction  time  assumed  is 
T=2  sec. 

2.  We  explicitly  include  a  lead  servo  tachometer  gain  for  a 
sensitivity  analysis  option  as  illustrated  in  the  sequel. 
Nominally,  Kt,  =  1. 

3.  The  crossfeed  gain  is  taken  as  K  =  1.1,  following  (1). 

4.  We  assume  a  second-order  dynamics  for  the  gun  turret  as  shown 

in  a  natural 
=0.5. 

resolve  the 

disparity  between  the  fast  sight  dynamics  and  the  slow  gun  turret 
dynamics,  and  its  function  is  to  lower  the  lead-servo  bandwidth. 


in  Fig.  3.  The  turret  servo  gain  is  K.p=10,  resulting 
frequency  of  a)„=K„=10  rad/ sec  and  a  damping  ratio  of  ? 
5.  The  hand-control 


(lae-lead)  filter  is  designed  to 
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In  the  analysis  that  follows,  the  lag  time-constant  1  sec, 
and  the  lead  time-constant  Tg=0.1  sec.* 

The  gunner  responding  to  the  tracking  error  stimulus,  generates 
the  control  command  u  which  drives  the  gun  turret  and  the  reticle. 
Since  the  reticle  angle  6  is  subtracted  from  the  target  signal 
e,p,  the  human,  in  effect , ^controls  the  system  defined  by  the  trans¬ 
fer  function  Tjj^(s),  viz., 

OrCs)  a^s5+a^s‘^+a3s3+a2s2+aj^s+aQS 

=  s(s3+K^s2+K^2s+KpK,j2)(.,^s4-l)(TLS+l) 


where  the  a.  coefficients  are  function  of  the  system 
parameters  ^(5),  Substitution  of  the  appropriate  parameter  values 
yields  the  pole  locations  at  0,  -100,  -1,  -1.23,  -4.4  +  j8.4  and 
zero  locations  at  -.94,  -1.39,  -42.72,  -3.6+  j7.1.  A  Bode  plot  of 
T^p(s)  is  given  in  Fig.  4  (the  transfer  function  pertaining  to  the 
Ss  system,  Tgg(s),  is  also  shown,  and  will  be  discussed  in  the 
subsequent  section) . 


Fig.  4.  Controlled  Element  Dynamics  eR(s)/u(s). 

It  may  be  noted  that  the  DR  system  has  characteristics  that  close¬ 
ly  approximate  a  rate-system,  throughout  the  entire  frequency 
range.  This  is  not  surprising,  since  the  system's  poles  and  zeros 
(other  than  the  pole  at  the  origin)  which  lie  within  the  gunner's 
response  range,  essentially  cancel  each  other.  We  see,  therefore, 
that  the  direct  effect  of  introducing  the  hand-control  filter  is 
to  reduce  the  bandwidth  of  the  gunner-controlled  element  such  that 
it  approximates  a  K/s-like  system.  Such  a  rate-system  is  a  very 
desirable  (easy)  system  for  manual  control.  One  must  realize. 


This  is  an  actual  XM-1  Chrysler  design  (4). 
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however,  that  this  system  represents  only  the  visual-loop  dynamics. 
It  will  be  shown  in  the  sequel  that  the  gun-pointing  dynamics 
result  in  a  totally  unsatisfactory  pointing  performance. 

STABILIZED-SIGHT  DIRECTOR 

The  director  type  system  structure  is  not  as  complex  as  the  DR's, 
as  the  turret  dynamics  do  not  affect  the  gunner's  visual  loop. 

The  physical  components  of  a  generic  stabilized  sight-director 
fire  control  system  are  illustrated  in  Fig.  5.  Instead  of  the 
three  servos  that  comprise  the  disturbed  reticle  system  the 
stabilized  sight  employs  two.  The  system  uses  a  rate  integrating 
gyroscope  on  the  sight  to  provide  an  inertial  reference  in  the 
azimuth  axis.  The  azimuth  sight  gyro  enables  the  effect  of  the 
gun  turret  rotation  to  be  removed  from  the  dynamics  of  the  sight. 
This  inertially  stabilizes  the  sight  so  that  it  is  independent  of 
turret  motion.  As  a  consequence,  the  performance  of  the  sight 
loop  will  be  limited  only  by  the  ability  of  the  human  operator  to 
track  a  target  since  the  sight  servo  is  several  orders  of  magni¬ 
tude  faster  than  the  operator  response.  In  the  subsequent  analysis, 
the  fast  sight  servo  dynamics  are  neglected. 

As  indicated,  the  SS  system  consists  of  a  two  servo  loops,  one 
driving  the  sight  and  the  other  driving  the  turret.  These  two 
loops  are  each  independently  stabilized,  both  by  rate  integrating 
gyros.  The  command  from  the  sight  servo  loop  to  the  gun  servo 
subsystem  is  used  to  pass  target  position  information  which  is 
used  in  computing  the  correct  angle  to  point  the  gun.  The  coupling 
from  the  gun  servo  to  the  sight  servo  loops  locally  references  the 
sight  servo  to  the  gun  and  is  due  to  the  fact  that  the  sight  motor 
moves  relative  to  the  turret. 


Fig. 5.  Physical  Configuration,  Fig.  6.  Block  Diagram, 

Stabilized  Sight.  Stabilized  Sight. 
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A  (linear)  block  diagram  of  the  SS  system  is  given  in  Fig.  6.  As 
shown  in  the  figure  the  dynamic  coupling  of  the  turret  motion  to 
the  sight  director  is  immediately  removed  bythe  gyro.  The  gunner 
applies  a  signal  to  the  gun  sight  servo.  This  signal  is  combined 

with  sight  position  and  rate  feedback  and  used  to  drive  the  sight 

servo  motor.  In  this  figure  represents  the  inertial  angle  of 
the  reticle  measured  by  a  rate  integrating  gyro  and  0^  represents 
the  inertial  angle  in  which  the  gun  xs  poiting.  Note  there 

is  no  need  to  lead  the  reticle  since  the  sight  is  independent  of 
the  gun  angle;  thus  the  sight  angle  and  the  reticle  angle,  0^^, 
are  the  same. 

The  lead  angle  computed  by  the  system  corresponds  to  T0j^  which 

when  added  to  0p  is  the  correct  angle  the  gun  should  aim  to  hit 

a  constant  velocity  target.  This  signal  is  used  to  command  the 
gun  servo  lead.  Angular  position  information  from  the  i^ate 
integrating  gyros  on  the  sight  and  gun  are  commanded  as  the 
difference  between  the  sign  and  gun  angles.  The  lead  command  will 
null  itself  when  ~ 

Since  we  analyze  in  this  paper  the  stabilized  sight-director 
concept,  and  not  an  existing  system,  we  may  choose  arbitrary 
dynamics  for  the  visual  loop.  The  appropriate  choice  is  a  rate 
system  1/s,  so  that  any  comparison  between  the  SS  and  the  UK 
systems  will  be  meaningful.  A  simplified  block  diagram  of  such 
a  system  is  shown  in  Fig.  7. 


CONTROLLED 

ELEMENT 


V  I 
j 


TARGET 

driving 

SIGNAL 


TANK  TURRET  SERVO 


(K^=10)  r/s,  t,  =  0.5 


Fig.  7.  Stabilized  Sight-Director  System. 

^  the  gunner's  controlled  element  dynamics  are  simply 


Ts3(.)-i  . 

(see  Fig.  4),  and  the  gun  turret  servo  is  represented  by  a  second- 
order  underdamped  system  with  characteristics  similar  to  the  DR  s. 

Thusfar,  we  have  established  the  dynamics  of  the  two  generic 
FC  systems,  the  performance  of  which  we  wish  to  analyze .  In  the 
following  section  we  give  a  brief  overview  of  the  OCM,  the  main 
analytical  tool  which  is  employed  in  the  analysis. 
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OVERVIEW  OF  THE  OCM 


The  Optimal  Control  Model  (OCM)  is  well  documented  in  the  litera¬ 
ture  (2)-(3).  A  brief  overview  follows.  Fig.  8  shows  the  structure 
of  the  OCM.  The  sight  dynamics,  Tjjp^(s)  or  Tgg(s),  as  well  as  any 
other  pertinent  system  states  are  represented  in  state  space  form 

x(t)  =  Ax(t)+bu(t)+Ew(t)+Fz(t)  ^2) 

y(t)  =  Cx(t)+du(t) 

where  w(t)  is  a  white -Gaussian  disturbance  with  intensity  W  and 
z(t)  is  a  deterministic  target  trajectory.  The  displayed  informa¬ 
tion  y(t)  consists  of  tracking  error  QgCt)  and  error-rate  6g(t) , 
and  may  include  other  auxiliary  variables  and  their  rate  of^change. 


DISTURBANCES 
w (t) ,  * (t) 


Fig.  8.  Optimal  Control  Model  of  Gunner  Response. 

The  visual  assumption  in  the  OCM  is  that  the  human  perceives  a 
delayed  and  noisy  replica  of  y(t) ,  viz., 

yp(t)  =  y(t-T)  +  Vy(t-T)  (4) 

where  the  white-Gaussian  observation  noise  v  . (t)  of  the  i-th 
indicator  has  covariance  intensity  ^ 


In  these  equations  t is  the  pilot's  lumped  time-delay,  and  p  .  is 
the  observation  noise/signal  ratio  of  y^(t). 

Given  y  (t) ,  the  operator  estimates  the  system  state  x(t-T)  and 
predict!  the  current  state  x(t) .  He  then  develops  an  optimal 
control  strategy  by  minimizing  the  quadratic  cost  functional 

J(u)  =  E{y'(t)Qyy(t)+qj.u2(t)}  (6) 

where  Q  =  diag  (q  . )  are  the  relative  weightings  on  the  observa¬ 
tions  aXd  q  is  thi^^control-rate  weighting*.  Usually,  the  track¬ 
ing  error  wiighting  coefficient,  q^,  is  non-zero,  since  the 
objective  is  to  minimize  the  observed  error.  This  strategy  results 
in  the  optimal  control  gains,  L,  arising  from  the  pertinent  steady- 
state  Riccati  equation.  With  the  inclusion  of  a  motor  noise, 

V  (t) ,  in  the  pilot's  control,  the  optimal  control,  u(t) ,  obeys 
tRe  equation 


TjjU+u=Lx+v^(t)=u^(t)+v^(t)  (7) 

The  parameter  can  be  interpreted  as  a  "neuro-motor"  time  con¬ 
stant.  Usually,  Tj,  is  specified  and  q^  is  adjusted  accordingly, 
as  there  is  a  one-to-one  correspondence  between  the  two. 

The  motor  noise  v  (t)  is  assumed  to  be  white  and  Gaussian  with 
intensity  that  scales  with  the  covariance  of  u(t) , 

V^(t)  =  np^  cov  [u(t)]  .  (8) 

The  coefficient  represents  the  motor-noise/signal  ratio. 

Modeling  efforts  utilizing  the  OCM  can  be  approached  in  either  of 
two  ways : 

1.  steady- state  mode,  or 

2.  time-varying  (nonstationary)  mode. 

In  the  steady-state  mode  z(t)=0,  and  the  target  input  is  modeled 
by  a  stationary  colored  noise.  The  usual  practice  is  to  let  the 
disturbance  w(t)  be  a  white -Gaussian  noise,  and  to  augment  the 
system  with  the  dynamics  of  a  noise  shaping  filter  that  character¬ 
izes  the  driving  noise.  The  stationary  approach  is  most  suitable 
for  frequency  domain  analysis  of  human  response.  The  model  equa¬ 
tions  are  represented  in  the  frequency  domain,  so  that  various 
performance  measures  can  be  extracted.  The  model  outputs  include 
any  of  several  possible  transfer  functions  associated  with  the 
human,  with  the  vehicle,  or  with  the  overall  closed- loop  system. 

In  addition,  the  performance  scores  (tracking  error  RMS  values) 
may  be  predicted. 


*It  is  assumed  that  the  human  seeks  a  control  strategy  that  would 
minimize  control-rate  rather  than  control. 
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In  the  time-varying  case,  the  target's  profile  is  deterministic, 
i.e.,  z(t)#0  and  w(t)=0.  This  modeling  technique  usually  involves 
a  comparison  of  experimental  data  time-histories  (first-and  second- 
order  statistics)  with  the  model-predicted  ones.  It  is  required, 
therefore,  to  develop  the  process  mean  and  covariance  propagation 
equations  which  arise  from  the  non-random  component  z(t)  that 
drives  the  system.  These  equations,  as  well  as  the  frequency 
domain  equations ,  have  been  thoroughly  documented  in  the  litera¬ 
ture  ,  e.g.  (2)-(3).  Next,  we  analyze  the  control  performance  of 
the  two  gunner/FC  systems.  The  analysis  is  performed  in  both 
frequency-  and  time-domain  by  employing  the  appropriate  OCM 
techniques . 


FREQUENCY-DOMAIN  PERFORMANCE  ANALYSIS 

In  this  section  we  develop  the  frequency  domain  model  for  target 
tracking.  First  the  statistical  properties  of  the  maneuvering 
target  are  determined.  Next  we  obtain  the  overall  system  state 
space  representation,  which  includes  the  target,  the  sight,  and 
the  gun  dynamics.  Finally  we  define  frequency  domain  measures 
(metrics)  for  the  fire  control  system  performance  evaluation  and 
comparison. 

TARGET  EXCITATION  SIGNAL 

It  is  assumed  that  the  target  is  an  enemy  tank  that  maneuvers 
towards  the  gunner  in  an  avoidance  path.  It  is  also  assumed  that 
the  target  range  is  large  enough  to  be  taken  as  constant.  There¬ 
fore,  the  only  target-related  variable  of  interest  is  the  target's 
lateral  displacement,  or  equivalently,  the  target's  sight  angle, 
e,j,(t)  .  We  assume  that  the  target  angle,  e,j,(t)  ,  is  a  second-order 
Markov  process  driven  by  white  Gaussian  noise,  w(t) ,  with  an 
intensity  of  W.  This  assumption  is  practical  and  sufficient  for 
our  modeling  effort.  The  noise-coloring  process  is  assumed  to  be 
a  butterworth  filter  with  bandwidth  co  =0.5  rad/sec.  We  obtain, 
therefore,  the  differential  equation  ror  target  motion, 

9.j(t)  +  2ea)^e^(t)  +  e^(t)  =  w(t)  (9) 


where  c  =  1/J2  ,  and  the  resulting  target  signal  power  spectral 
density  is  ^ 


a)^W 

^T^“^  ""  u)'*+a)'+ 
n 


(10) 


The  appropriate  target  signal  power  value  is  selected  as 
E{0|(t)}  =  =  16  mrad2 


(11) 


n 


(12) 


i.e.,  0rp  =4  mrad.  This  selection  dictates  the  power  of 

6„  ’  ,  hnS^it  can  be  shown  that  for  a  butterworth  filter, 

T , rms ’ 

®T,rms  ~  “n^T.rms 

Therefore,  since  co  =  0.5  rad/ sec,  we  obtain  0,j,  j-„g=2  mrad/sec. 

If  we  further  assume  that  the  (~constant)  range;  k=2000  meters, 
we  obtain  an  equivalent  target  RMS  lateral  displacement  and 
velocity  (R0rr  and  R0„  .respectively)  of  8  meters  and 

4  meters/secoA5?^respecti^eTy.  These  displacement,  velocity, 
range,  and  bandwith  values  compare  well  with  the  Hardison  tracking 
test  simulations  reported  in  (4).  (Actually,  any  one  of  these 
four  parameters  is  determined  by  the  other  three.) 

MODEL  DEVELOPMENT 

The  OCM  structure  requires  a  state-space  form  of  the  controlled 
element / target  dynamics.  Table  1  defines  the  state  variables 
for  the  DR  and  SS  systems , 


TABLE  1.  STATE  VARIABLES  FOR  THE  DR  and  SS  SYSTEMS 


^1 

X2 

^3 

^4 

^5 

^6  ^7 

^8 

DR 

®T 

®T 

Lead 

Servo 

Angle 

HC 

Filter 

Signal 

“g 

0p  Input  Signal  to 
Turret  Servo  LPF 

®R 

SS 

®T 

0.J, 

®R 

»G 

One  may  notice  that  X/=0p  and  x,-— 0^  in  the  SS  system  are  unobser¬ 
vable  states.  They  are  included  so  that  predictions  can  be  made 
of  the  gun  motion. 

The  displayed  information  in  all  systems  is,  of  course,  the  tracking 
error,  0p.(t),  but  the  gunner  extracts  rate  information,  0j.(t)  ,  as 
well.  The  observation  set  is,  therefore,  y'=  ^g] •  Ihe  state 

equations  for  the  two  systems  can  now  be  easily  derived,  and  are 
given  elsewhere  (5).  The  system  parameter  values  are  listed  in 
a  previous  section.  For  the  gunner  we  select  the  OCM  nominal 
parameter  values:  p^=-25db;  p  =p  2^"20db;  Tj^=.15  sec;  t=.2  sec. 
The  cost  functional  used  is  y i-  ^ 

J(u)  =  E|02(t)  +  q^u2(t)|  (13) 

where  q  is  uniquely  determined  by  the  neuromotor  time  constant, 
■^N- 
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FREQUENCY  DOMAIN  MEASURES 

The  following  frequency  domain  metrics  are  used  in  the  performance 
analysis . 

1.  Gunner  Describing  Function,  H(s) (magnitude  and  phase)  —  This 
transfer  function  conceivably  can  be  calculated  from  field  (or 
simulation)  data.  It  is  defined,  relative  to  the  given  input 
disturbance,  w(s)*,  as  the  ratio  of  the  transfer  function  between 
input  u  and  noise  w  to  the  transfer  function  between  output  0p 
and  noise  w.  Thus,  assuming  negative  feedback 


H(s) 


u(s) /w(s) 
0g(s)/w(s) 


or  conceptually. 


H(s) 


=  u(s) ' 
■ 


(14) 


(15) 


2.  Circulatory  Transfer  Function,  0(s)  (from  0^  back  to  0^ 
signal)  —  This  is  a  rather  complex,  but  quite  important  transfer 
function.  For  single  indicator /display  system  it  is  equal  to  the 
single-axis  Y  Y  (human-vehicle  open-loop)  found  in  the  early  man- 
machine  literRtfire  (6) .  For  multiloop  systems  it  is  equivalent  to 
an  "outer-loop"  describing  function.  It  is  defined  as  follows. 

In  the  closed- loop  system  we  open  the  loop  at  a  given  indicator 
containing  position  information  and  rate  information  (0p,0p  in  the 
case  at  hand).  The  loop  transfer  function,  starting  at^the 
indicator  0p,  going  through  the  human,  the  vehicle  (slight/gun)  and 
ending  at  0p  is  0(s).  Quantities  such  as  gain  and  phase  margins, 
as  well  as  loop  bandwidth  are  obtained. 

In  the  context  of  this  stability  metric,  we  investigate  the 
robustness  of  the  disturbed  reticle  system  to  sight  tachometer, 

Kjj,  failures  (in  the  stabilized  sight-director  the  predictor  sub¬ 
system  is  out  of  the  visual  loop,  and,  therefore,  failures  of  the 
tachometer  have  no  effect  on  the  loop  stability) .  The  underlying 
assumption  in  this  analysis  is  that  the  gunner  cannot  adapt 
instantaneously  to  sudden  failures  in  the  sight  tachometer,  and 
his  describing  function,  H(s) ,  remains  tuned  to  the  old  system, 
for  which  Ktj=1-  We  then  compute  0(s)  which  is  comprised  now  of 
the  "old"  gunner  model  and  the  "new"  (degraded)  system. 

3.  RMS  values  (scores)  of  the  tracking  error,  0g(t)  -  This  metric 
is  a  measure  of  the  gunner's  tracking  performance.  We  do  not 
analyze  the  pointing  performance  of  the  FC  systems  in  the  frequency 


'We  recognize  that  w(t)  does  not  strictly  have  Laplace  transform. 
This  is  not  a  problem  since  it  "falls  out"  in  the  compution  of 
H(s). 
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domain,  since  it  involves  signals  at  different  times  (the  pointing 
error  is  defined  as  e,p(t+T) - 9g(t)  ,  where  T  is  the  projectile’s 
time-of-flight;  also  see  subsequent  section),  and  the  OCM  steady- 
state  analysis  can  treat  events  at  time  t  only.  The  pointing  per¬ 
formance  is  addressed  in  the  subsequent  time-domain  analysis. 


FREQUENCY  DOMAIN  MODELING  RESULTS 

The  modeling  results  are  summarized  in  Figures  9  through  11. 

These  results  are  discussed  below. 

1.  Gunner  Transfer  Function  —  Figure  9  shows  the  magnitude  and 
phase  of  H(s)  for  the  two  FC  systems.  The  difference  between  the 
DR  and  the  SS  are  relatively  minor,  as  the  visual  loop  dynamics 
in  the  two  systems  are  almost  identical  (see  Fig.  4).  What  is ^ 
displayed  in  Fig.  9  is  a  typical  human  operator  transfer  function 
controlling  a  rate-like  system. 


(a)  magnitude  PHASE 

Fig.  9.  Gunner  Transfer  Function,  H(s) . 


2.  Open-Loop  Transfer  Function  -  First  we  examine  the  loop 
(circulatory)  transfer  function,  0(s) ,  for  the  two  FCs  under 
nominal  conditions.  It  was  found  that  both  DR  and  SS  systems 
resulted  in  similar  0(s)  transfer  functions  in  the  nominal  case. 
For  that  reason,  only  the  DR's  circulatory  transfer  function  is 
shown  in  Fig.  10.  The  0(s)  function  for  the  SS  can  be  assumed  to 
be  the  same.  The  gain  and  phase  margins  are  comparable  (  6-7  dB 
and  40® ,  respectively) .  Next  we  assume  that  the  sight  tachometer 
gain  in  the  DR  system  is  abruptly  halved,  i.e.,  K„=.5.  The  re¬ 
sulting  open-loop  transfer  function  changes  dramatically  as 
evidenced  by  Fig.  11.  It  is  clear  that  if  the  gunner  fails  to 
adapt  to  the  new  conditions  (which  is  assumed  to  be  the  case 
under  normal  circumstances)  the  closed-loop  DR  system  becomes 
unstable.  In  the  stabilized  sight-director,  on  the  other  hpd, 
such  danger  is  nonexistent,  since  the  lead  prediction  loop  is 
disengaged  from  the  sight  dynamics.  The  only  effect  of  a 
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tachometer  failure  in  the  SS  system  is  a  degraded  pointing  per¬ 
formance,  while  all  stability  margins  are  maintained  at  their 
original  levels. 


Fig.  10. 


DR  Gunner-Vehicle  Open- 
Loop  Transfer  Function, 


Kr=1 


Fig.  11.  DR  Gunner- Vehicle 
Open-Loop  Transfer 
Function,  Kj^=0.5 


3.  RMS  of  Tracking  Error  (Scores)  —  The  OCM  predicted  scores  for 
the  DR  and  the  SS  systems,  given  the  hypotesized  target  dynamics, 
are  0.50  and  0.53  mrads ,  respectively.  These  scores  are  of  the 
same  order  as  those  reported  in  (4).  Thus,  the  difference  in  the 
tracking  performance  between  the  two  systems  is  insignificant. 
This  result  could  be  expected  since  both  systems  have  almost 
identical  visual  loop  dynamics . 


Next  we  analyze  the  FC  system's  performance  in  the  time-domain, 
where  more  significant  differences  are  uncovered. 


TIME-DOMAIN  PERFORMANCE  ANALYSIS 


TARGET  MANEUVER 

In  contrast  to  the  frequencey  domain  analysis,  the  target  tra¬ 
jectory  now  is  assumed  to  be  "deterministic".  In  order  to  expose 
the  differences  between  the  two  FCs  and  to  evaluate  their  lead 
prediction  capabilities,  it  was  required  to  design  a  target  maneu¬ 
ver  with  appreciable  acceleration  levels.  We  assume  a  target 
which  maneuvers  toward  the  gunner  at  an  approximate  range  of  R=750 
meters.  The  maneuver  begins  at  t=l  sec  and  ends  at  t=13  sec,  at 
which  point  the  target  continues  its  movement  with  a  constant 
velocity.  The  assvimed  trajectory  is  shown  in  Fig.  12. 
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Following  (1),  we  assume  a  generalized  Poisson-like  tangential  and 
radial  target  accelerations.  Since  the  range  assumed  is  effectively 
"constant”,  the  gunner  tracks  only  the  target's  lateral  motion. 
Therefore,  the  only  relevant  quantity  for  the  modeling  effort  is 
the  target  (resultant)  lateral  acceleration,  z(t).  This  accelera¬ 
tion  is  expressed  in  mrad/sec^  and  its  profile  is  shown  in  Fig.  13. 
A  more  detailed  description  of  this  target  intentional  maneuver  is 
given  in  (5) . 


Fig.  12.  Representative  Fig.  13.  Resultant  Lateral 

Target  Trajectory.  Acceleration. 

MODEL  DEVELOPMENT 

The  DR  and  the  SS  systems  are  of  course  unchanged,  and  we  use  the 
states  variables  which  were  defined  previously. ^  The  state  equa¬ 
tions  are  unchanged  except  for  the  target  dynamics  (first  and 
second  state  variables) ,  which  are  now 

Xi  =  X2 ,  ^2  =  z(t)  (16) 

The  z(t)  component  is  modeled  in  the  OCM's  Kalman  Filter  (Fig.  8) 
as  a  "pseudo"  white-noise  with  the  intensity 

W^(t)  =  2T^,z2(t)  ,  (17) 

where  x  =1  sec,  and  is  interpreted  as  the  correlation  time  of 
the  z(t^  process.  All  OCM  parameters  as  well  as  the  cost  func¬ 
tional  J(u)  (13)  remain  unchanged. 

TIME-DOMAIN  MEASURES 

Applying  the  target  driving  acceleration,  z(t),  the  ensemble  mean 
and  covariance  equations  of  the  closed- loop  system  are  propagated. 
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It  is  possible,  then,  to  obtain  time-histories  (first  and  second 
order  ensemble  statistics)  of  the  various  signals  in  the  control 
loop.  In  the  present  analysis  we  compare  the  following  time- 
histories,  obtained  for  the  two  FC  systems. 

1.  Tracking  Error,  9g(t)  —  Based  upon  the  similarity  of  the 
tracking  loop  dynamics,  and  the  frequency- domain  results,  no 
significant  differences  are  expected  in  tracking  performance. 

2.  "First-order"  gun  pointing  error,  0„p^(t)  —  This  quantity  is 
defined  as 


®TG^ (t) 


),j,(t)  +  Te,j,(t)  -  0g(t) 


(18) 


Usually,  one  is  interested  in  the  tracking  error,  0g(t) ,  statis¬ 
tics,  as  this  is  a  fundamental  measure  in  any  tracking  performance 
modeling  effort.  In  the  case  at  hand,  we  are  also  concerned  with 
the  evaluation  of  the  dynamic  properties  of  the  two  FC  systems. 

The  quantity  0,j,p^(t)  is  an  excellent  metric  for  such  evaluation  for 
the  following  reason:  If  we  replaced  the  turret-servo  second-order 
dynamics  with  a  unity  gain  (i.e.  ,  Kj,-»-“)  ,  and  if  we  assumed  that  the 
gunner  tracks  perfectly  (i.e.,^  ©^(tjHO),  then,  for  constant  velo¬ 
city  targets,  0g(t)  =  0„(t)+T0„(t) ,  simply  because  of  the  fact 
that  a  first-order  lead'^predictor  is  employed  in  each  of  the  FC 
systems.  Therefore,  by  evaluating  the  "first-order"  pointing- 
error,  0„p^(t),  we  eliminate  the  pointing  errors  which  are  intro¬ 
duced  by'^the  inability  of  the  first-order  predictor  to  account  for 
maneuvering  (accelerating)  targets.  The  signal  0,j.Q^(t)  merely 
reflects  the  effect  of  the  tracking  error  on  the  gun  output,  and, 
more  importantly  it  is  a  measure  of  the  dynamic  response  of  the 
FC  systems  at  hand. 

3.  Total  gun-pointing  error. 


S^nCt)  —  This  angle  is  defined  as 

iva 


0TG(t)  “  ^ 


0G(t) 


(19) 


and  it  reflects  the  total  pointing  error  between  the  target  at 
T(=2  sec)  ahead  and  the  present  gun  pointing  angle.  This  quantity 
is  a  measure  of  the  predictor's  performance  rather  than  the  FC 
system's.  In  this  sense,  the  statistical  analysis  of  0rpQ(t)  is  a 
prelude  to  any  future  research  effort  on  improving  lead-angle  pre¬ 
diction  mechanisms. 

MODELING  RESULTS 

The  modeling  results  are  summarized  in  Figs.  14  through  16. 

1.  Figure  14  shows  the  tracking  error  statistics  for  the  two  sys¬ 
tems.  Shown  are  (a)  ensemble  mean,  0g(t),  and  (b)  ensemble  standard 
deviation,  ag(t).  As  expected,  large  transients  are  evident  in 
the  vicinity^of  t— 2  sec  and  tci;8  sec,  as  the  largest  acceleration 
peaks  occur  at  these  times.  The  error  mean  and  variance  values 
are  comparable  to  the  results  reported  in  (4) .  No  significant 
differences  in  tracking  performance  are  detected  between  the  SS 
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and  the  DR  systems,  because  of  the  similarity  of  the  visual-loop 
dynamics.  Any  conclusions  from  these  results,  however,  may  be 
misleading  as  now  shown. 

2.  Figure  15  shows  the  "first  order"  pointing  error  results.  It 
is  clear  that  the  DR  system  (which  represents  the  actual  Chrysler 

an,  while  its  stan- 
st.  The  physical 
gun  response  is 

sluggish  and  exhibits  a  remarably  small  variability  by  filtering 
out  high  frequency  variations.  Since  the  tracking  error  (e„)  for 
the  DR  is  actually  the  smallest ,  it  is  clear  that  the  DR' s  unsatis 
factory  pointing  performance  is  due  to  its  sluggish  dynamics , 
induced  by  the  hand  control  filter.  It  is  unfortunate  that  good 
tracking  performance  has  been  achieved  at  the  expense  of  poor 
pointing  performance. 


Fie  14  Tracking  Error  Ensemble  Fig.  15  "First-Order"  Gun  Point 
Statistics  ing  Error  Statistics 

3.  Figure  16  shows' the  ensemble  statistics  of  the  actual  point¬ 
ing  angle  0rnp(t)  .  The  standard  deviation,  o,jiQ(t)  ,  is  not  shown 
since  it  is the  same  as  in  Fig.  15. 


Fig.  16  Total  Pointing  Error  Ensemble  Mean 


design)  exhibits  the  largest  pointing  error  me 
dard  deviation  ,  is  actually  the  smalle 
internretation  of  this  result  is  that  the  DR's 
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Two  observations  are  quite  obvious : 

1.  The  pointing  error  and  covariance  levels  are  extremely  large 
(over  20  mrad  at  one  point)  and  reflect  the  failure  of  the  first- 
order  lead  predictor  to  handle  accelerating  targets . 

2.  The  DR  exhibits  a  significantly  worse  pointing  performance 
than  the  SS.  The  larger  and  longer  persisting  errors  in  the  DR 
system  are  accorded  to  the  sluggish  gun  response,  which  results 
from  the  introduction  of  the  hand- control  filter. 


CONCLUSIONS 

This  paper  dociaments  partial  results  of  a  research  effort  to 
develop  analytical  techniques  for  evaluating  tank  fire  control 
systems,  and  to  apply  the  techniques  to  evaluate  existing  (dis¬ 
turbed  reticle)  and  proposed  (stabilized  sight-director)  fire 
control  system  mechanizations!  The  technique  through  which  the 
tv7o  FC  systems  are  analyzed  is  the  Optimal  Control  Model  (OCM) 
of  human  response.  This  modeling  approach  includes  the  gunner  in 
the  closed-loop,  i.e.,  it  considers  the  gunner- operator  as  an 
integral  part  of  the  FCS.  The  modeling  results  uncover  some  of 
the  weaknesses  of  the  disturbed  reticle  system.  In  particular, 
it  is  shown  that  under  certain  conditions  the  DR  becomes  dynamic¬ 
ally  unstable  and,  in  general,  its  gun-pointing  performance  is 
unsatisfactory.  In  addition,  it  is  shown  that  first-order  lead- 
angle  prediction  is  inappropriate  in  engagements  involving 
maneuvering  targets.  Thus,  a  higher  order  predictor  is  necessary. 
This,  however,  is  d  subject  of  future  research  efforts. 
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ABSTRACT 

A  class  of  general  structured  discrete-time  deterministic  observers  is 
developed.  The  one-step  predicting  or  Luenberger  observers  and  a  class  of 
current-update  observers  may  be  obtained  from  this  class  of  general  structured 
observers.  Interesting  relationships  and  important  properties  among  various 
full-order  observers  are  established.  Optimal  discrete-time  observers  are 
then  developed,  and  it  is  shown  that  these  optimal  observers  are  structually 
and  numerically  equivalent  to  various  forms  of  Kalman-Bucy  filters. 


1.  INTRODUCTION 

As  is  well  known,  the  development  and  applications  of  filters  and 
observers  for  stochastic  and  deterministic  systems  have  dominated  the 
literature  for  almost  two  decades,  [1-20].  Among  the  most  well-known 
techniques  for  estimating  the  state  of  a  system  are  the  stochastic  Kalman-Bucy 
filters  [1-4]  and  the  deterministic  Luenberger  observers  [5-8].  It  is  also 
well  known  [10,13,14]  that  an  equivalence  between  the  one-step  predicting 
Kalman-Bucy  filter  and  the  full-order  Luenberger  observer  can  be  established 
for  a  certain  choice  of  observer  gain.  However,  relationship  between  the 
various  forms  of  Kalman-Bucy  filters  and  other  possible  forms  of  deterministic 
observers  which  may  provide  additional  insight  into  observer  theory  are 
obscure. 

The  main  purpose  of  this  paper  is  two  fold.  The  first  objective  is  to 
present  a  class  of  general  structured  discrete-time  deterministic  observers. 
The  one-step  predicting  or  Luenberger  observers  [5-8]  and  a  class  of 
current-update  observers  may  be  obtained  from  this  class  of  general  structured 
observers.  Interesting  properties  among  various  forms  of  observers  are 
established  and  it  is  shown  that  the  one-step  predicting  or  Luenberger 
observers  and  the  current-update  observers  m.ay  be  related  via  a  set  of 
time-update  and  measurement-update  equations.  The  second  objective  is  to 
develop  a  set  of  optimal  gains  for  the  class  of  general  structured  observers. 
The  optimal  gains  are  then  used  to  establish  a  useful  relationship  between  the 
optimal  deterministic  observers  and  the  various  forms  of  Kalman-Bucy  filters. 
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The  o  rg3Tii  za  t  ion  of  the  paper  is  as  follows*  Section  2  of  the  paper 

presents  the  development  of  the  class  of  general  structured  discrete-time 
deterministic  observers*  Section  3  describes  the  relationship  among  various 
forms  of  full-order  discrete-time  deterministic  observers.  Optimal 

deterministic  observers  are  derived  in  Section  4  using  the  notion  of  gradient 
matrices  [21-24],  In  Section  5,  a  useful  equivalence  between  optimal 

full-order  observers  and  the  steady-state  or  time -invariant  Kalman-Bucy 
£xlti6r5  is  6stablish6d*  Throughout  tho  papoiTj  th©  notatioiij  zCi  |  will  b© 

used  to  denote  the  value  of  z  of  a  dynamical  equation  at  the  discrete  instant 

of  time  i  updated  by  utilizing  the  measurements  up  to  the  instant  of  time  j. 


2.  DISCRETE-TIME  DETERMINISTIC  OBSERVERS 
Consider  a  deterministic  dynamical  system  described  by 


x(k+l)  *  Ax(k)  +  Bu(k)  ,  x(0)  =  Xq. 

y(k)  =  Cx(k), 


where  x(k)  e  R",  u(k)  cR’^,  y(k)eR"',  x^eRn  is  arbitrary,  and  A,  B,  C  are 
constant  matrices  of  compatible  dimensions.  With  no  loss  of  generality,  it  is 
assumed  that  rank[C]  —  m*  In  general,  m<n  so  that  the  complete  state  x(kl 
cannot  be  uniquely  determined  from  the  output  relation  (lb)  alone.  however, 
subject  to  the  observability  or  detectability  of  (l),  an  estimate  of  x(k)  can 
be  generated  using  an  observer. 

In  this  section,  the  development  of  a  general  class  of  discrete  time 
deterministic  observers  is  investigated.  Hence  consider  a  general  class  of 
systems  having  the  following  structure: 


z(k+l|k+l)  =  Fz(k|k)  +  Gy(k+l)  +  Gy(k)  +  Hu(k), 
z(o|o)  =  Zq, 

x(k|k)  =  Pz(klk)  +  Vy(k), 


(2a) 

(2b) 

(2c) 


where  z(k|k)  eR^  with  q  being  >  the  dimension  to  be  determined,  Zg  is  an 
arbitrary  initial  condition,  F,  G,  G,  H,  P,  and  V  are  dimensionally  compatible 
constant  matrices  to  be  determined,  rank[P]  =  q,  and  x(k|k)eR'^  is  the  output 
of  (2). 

Definition  1:  System  (2)  is  said  to  be  a  general  structured  observer  for  (1) 
if 

lim  [x(k|k)  -  x(k)]  =  0. 


Define  estimation  error  vectors  as 
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(4a) 


e2(k|k)  -  zCklk)  ~  Tx(k), 
ex^kjk)  “  x(k[k)  ■  x(k),  (4b) 

where  T  is  to  be  determined;  and  whereby 


lim 

k-^ 

e2(k|k)  -  0, 

(5a) 

lim 

k->« 

e  (k|k)  =  0, 

A 

(5b) 

imply  that  2(k|k)  estimates  Tx(k)  and  x(k[k)  estimates  x(k). 

From  (l),  (2)  and  (4),  it  is  straightforward’  to  show  that 

e2(k’^llk’^l)  =  Fe2(k|k)  (FT•^GCA+GC-TA)x(k)+(H“TB•^GCB)u(k) ,  (6a) 

e  (0|0)  =  2(0)  -  Tx(0),  (6b) 

z 

e  (k[k)  =  Pe  (kik)  +  (PT+VC-I  )x(k).  (6c) 

X  Z  11 

The  following  theorem  governs  the  existence  of  a  general  structured  observer 
given  by  (2). 

Theorem  1 :  Suppose  (l)  is  completely  observable  and  completely  reachable. 

Then  (2)  is  a  general  structured  observer  for  (l)  if  and  only  if  the  following 
conditions  are  satisfied: 


(a) 

|X£[F]|<1,  i  I,...,  n, 

(7) 

(b) 

FT  GC  GCA  =  TA, 

(8) 

(c ) 

H  =  TB  -  GCB, 

(9) 

* 

(d) 

PT  +  VC  =  In, 

(10) 

where  A£[*]  denotes  the  i“th  eigenvalue  of  [*]* 


k 
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Proof :  See  Appendix  A. 


Remark  1:  If  (l)  is  completely  observable  and  stabilizable  but  not  completely 
reacbable>  then  (7)~(10)  in  Theorem  1  are  only  a  set  of  sufficient  conditions 
for  the  existence  of  observer  (2).  This  conclusion  follows  from  the  proof  of 
Theorem  1  where  condition  (b)  may  not  be  always  necessary  when  (l)  is  only 
stabilizable.  ^ 

In  the  sequel,  two  subclasses  of  the  general  structurea  observers  given  by 
(2)  are  described,  namely,  the  current-update  observer  and  the  one-step 
predicting^  or  Luenberger  observer  [5-8].  A  current -update  observer  takes 
the  form 


z(k+l  |k+l)  =  Fz(k|k)  +  Gy(,k+l)  +  Hu(k), 

(11a) 

N 

O 

II 

N 

O 

(11b) 

x(k|k)  =  PzCkjk)  +  Vy(k), 

(11c) 

where  (ll)  stems  from  (2)  with  G  =  0.  The  term  "current  update"  is  usea  to 
stress  the  fact  that  the  most  current  measurement  y(k+l)  is  utilized  for 
updating  the  state  z(k  | k)  of  the  dynamical  equation  Clla) .  The  following 

corollary  follows  from  Theorem  1. 

Corollary  1:  Suppose  (l)  is  completely  observable  and  reachable.  Then  111) 
is  a  current-update  observer  for  (l)  if  and  only  if 

(a)  Ui[F]  |<1,  i  *  1,...,  n. 

(12) 

(b)  FT  +  GCA  =  TA, 

(13) 

(c)  H  =  TB  -  GCB, 

(14) 

(d)  PT  +  VC  =  In- 

(15) 

A 

On  the  other  hand,  setting  G  =  0  in  (2)  yields  the  one-step 
Luenberger  observers,  having  the  form  [5-8], 

predicting  or 

z(k+l!k)  =  Fz(klk-l)  +  Gy(k)  +  Hulk), 

( 16a ) 

z(0  l-l)  =  Zq, 

(16b) 

x(k|k-l)  =  Pzik|k-l)  +  Vy(k), 

( I6c ) 

1 See  Remark  3. 
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where  the  notations  z(k+l|k)  and  x(k+l|k)are  used  to  emphasis  the  fact  that 
(16)  is  updated  by  a  one“step  previous  measurement  or  that  (l6a)  is  a  one-step 
prediction  algorithm,  and  Zq  is  an  arbitrary  initial  condition.  Similarly, 
Theorem  1  reduces  to  the  following  well  known  result  [5-8]  when  G  =  0. 

Corollary  2;  Suppose  (l)  is  completely  observable  and  reachable.  Then  (16) 
is  a  one-step  predicting  or  Luenberger  observer  for  (l)  if  and  only  if 


(a) 

1X£[F]|<1,  i  1,...,  n. 

(17) 

(b) 

FT  +  GC  =  TA, 

(18) 

(c  ) 

H  =  TB, 

(19) 

(d) 

PT  +  VC  =  In* 

(20) 

1 


Remark  2:  The  matrices  F  and  G  in  (16a)  may  be  shown  to  be  given  by 
F  =  TAP  +  Ui  -  (UiT  +  UoC)  f  T 

[cj  •  " 

G  =  TAV  +  U2  -  (U^T  +  U2C)  r  T 

L  c  J  ■ 

where  e  and  U2  €  are  arbitrary  constant  matrices  and  [  • 

denotes  the  psendo*inverse  [25,26]  of  [  •]• 

A 

3.  RELATIONSHIPS  AMONG  FULL-ORDER  DISCRETE-TIME 
DETERMINISTIC  OBSERVERS 

In  Section  2,  the  general  structured  observer  described  by  (2),  the 
current-update  observer  described  by  (ll)  and  the  one“step  predicting  or 
Luenberger  observer  described  by  (16)  are  all  of  order  q  where  q^n«  In  this 
section,  full-order  general  structured,  current-update  and  one-step  predicting 
or  Luenberger  observers  with  q=n  will  be  considered.  The  objective  is  to 
study  the  various  interesting  relationships  and  important  properties  of  these 
full-order  observers*  Furthermore,  the  relationship  between  the  full-order 
observers  developed  in  this  section  and  the  Kalman-Bucy  filter  will  be 
investigated  in  section  5,  where  it  will  be  shown  that  a  deterministic 
full-order  observer  can  become  a  Kalman-Bucy  filter  under  suitable 
conditions.  The  general  structured  observer  described  by  (2)  is  theretore  of 
fundamental  and  practical  importance. 

To  proceed,  let  T  ==  I^,  P  =  I^  and  V  =  0,  then  Theorem  1  yields  a 
full-order  general  structured  observer  of  the  form 
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(23a) 


x(k+l!k+l)  =  (A  -  GC  -  GCA)x(k|k)  +  Gy(k+1)  +  Gy(k) 

+  (B  -  GCB)u(k) 

~  Ax(k|k)  Bu(k)  G[y(k'*'l)  ~  CAx(k  jk)  “  CBu(k)] 

+  G[y(k)  -  Cx(klk)]. 
x(0|0)  =  Zq.  ^23b) 

Setting  G  ®  0  in  (23)  establishes  a  full-order  current-update  observer  (see 
also  [18-20]) 

x(k+llk+l)  *  (A  -  GCA)x(k|k)  +  Gy(k+l)  +  (B  -  GGB)u(k) 

*  Ax(klk)  +  Bu(k)  +  G[y(k+1)  -  CAx(klk)  -  CBu(k)],  (24a) 

x(o|o)  =  Zq  ;  (24b) 


while  setting  G  =  0  in  (23)  results  in  a  full-order  one-step,  predicting  or 
Luenberger  observer  [5“7] 

xCk+llk)  =  (A  -  GC)x(k|k-l)  +  Gy(k)  +  Bu(k) 

=  AxCk|k-*l)  +  Bu(k)  +  G[y(k)  *  Cx(klk“l)],  (25a) 

x( 0  |"l)  =  Zq9  (25b) 

where  the  notation  x(k+llk)  is  used  to  emphasize  that  (25)  is  a  one^step 
prediction  algorithm. 

Remark  3:  Equation  (2A)  has  the  form  of  a  Kalman-Bucy  true  filter  while  *(25) 
has  the  form  of  a  Kalman-Bucy  one-step  predicting  filter,  [1-4];  hence,  the 
Luenberger  observer  described  by  (25)  has  also  been  referred  to  as  a  one-step 
predicting  observer  in  this  paper.  Also,  it  is  observed  that  (24)  and  (25) 
can  respectively  be  obtained  from  Corollaries  1  and  2  with  l-I^, 

V=0.  Furthermore,  (25)  can  be  derived  from  Remark  1  with  ^ 

Now  the  estimation  errors  associated  with  (23),  (24)  and  (25)  may  be  shown 
to  be  given  by,  respectively, 


ej^(k+l  k+l)  (a  “  (5c  -  GCA)ex^R 


=  A  -{[G  I  G] 


}e^(k  k) 


(26a) 
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^26b) 


ex^olo)  =  Xq-Zq  ; 

ex^k+lIk+1)  *  (A  -  GCA)ex(klk),  C27a) 

Cx^olo)  =  Xq-Zq;  C27b) 

ex^k+ljk)  =  (A  -  GC)exCk|k-l),  (28a) 

ex(ol-l)  =  Xq-Zq.  (28b) 


Since  it  is  desired  that  the  estimation  errors  given  by  (26),  (27)  and  (28)  be 
made  to  approach  the  origin  asymptotically,  the  gain  matrices  G  and  G  must  be 
chosen  such  that  (A  -  GC  -  GCA),  (A  -  GCA),  and  (A  -  GC)  are  asymptotically 
stable.  It  follows  that  the  stabilities  of  the  three  matrices  are  closely 
related.  The  concept  of  observability  and  detectability  [27-29]  is  useful  and 
the  following  lemma  is  of  importance. 

Lemma  1 : 

(a-l)  The  pair  [A,  C]  is  completely  observable  if  and  only  if  the  pair 
[A,  (C*!a'C*)*]  is  completely  observable.  Furthermore,  if  the 

observability  index  of  [A,  C]  is  n^,  then  the  observability 
index  of  [A,  (C*jA*C*)*]  is  nQ-l. 

(a-2)  The  pair  [A,  C]  is  completely  detectable  if  and  only  if 
[A,  (C'jA'C*)*]  is  completely  detectable. 

(b-l)  If  A  is  nonsingular,  then  [A,  C]  is  completely  observable  if  and 
only  if  [A,  CA]  is  completely  observable. 

(b-2)  If  A  is  singular  and  the  pair  [A,  C]  is  completely  observable, 

then  the  pair  [A,  CA]  is  completely  detectable. 

(b-3)  If  the  pair  [A,C]  is  completely  detectable,  then  the  pair  [A,CA] 

is  completely  detectable.  i 

Hence  it  is  clear  from  Lemma  1  that  depending  ^on  the  observability  or 
detectability  of  the  pair  [A,C],  constant  matrices  G  and  G  can  be  found  such 
chat  !Xi(A-GC-GCA)  1<1,  IXi(A-GCA)|<l  and  |  X^(A-GC)|  <  1 ,  for  ail 

i— 1 ,2,...,n. 

A  further  useful  relationship  between  the  full-order  current-update 
observer  (24)  and  the  one-step  predicting  or  Luenberger  observer  (25)  may  be 

established  via  a  pair  of  well  known  time-update  and  measurement-update 
equations  [3].  Using  the  dynamics  of  system  (l),  a  time-update  or  prediction 
algorithm  may  be  defined  as 


x(k+llk)  =  Ax(klk)  +  Bu(k), 


(29a) 
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x(Oi  O)  =  z 


(29b) 


o> 


where  x(k+l]k)  is  a  one“step  predicted  estimate.  With  this  definition,  the 
current“update  observer  (24)  can  be  written  as 


x(k+llk+l)  *  x(k+l|k)  +  G[y(k+l)  “  Cx(k+l|k)], 


(30a) 


x(0|-l)  =  z^ 


(30b) 


where  (30)  is  a  measurement-update  equation  [3].  Substituting  (30)  into  (29) 
yields 


x(k+lik)  =  Ax(klk-l)  +  Bu(k)  +  AG[y(k)  -  Cx(k|k-l)], 


(31a) 


x(0|-l)  =  , 


(31b) 


where  (3l)  has  the  structure  of  the ^one-step  predicting  or  Luenberger  observer 
(25).  It  is  clear  that  if  the  gain  G  of  (25)  is  chosen  as 


G  ~  AG, 


(32) 


then  (25)  and  (3l)  describe  the  same  estimate,  i.e.. 


x(k  |k-l)  =  x(k  Ik-l) . 

~  2 

Maintaining  G  =  AG,  it  can  be  shown  that 

[A  —  GCA]  *  X^ [A  ~  GC],  i“l,...,  n. 


(33) 


(34) 


It  then  follows  that  the  stabilities  and  the  convergence  properties  of  the 
estimation  errors  of  the  full-order  current— update  and  one— step  predicting  or 
Luenberger  observers  are  equivalent.  It  is  further  remarked  that  the 
relationship  G  *  AG  also  turns  out  to  be  a  natural  consequence  of  a  set  of 
optimal  gains  for  G  and  G  as  will  be  pointed  out  in  Section  5. 


Observe  that 


X.  [MN] 
1 


X.  [NM],  where  M  and  N  are  square  matrices, 
i 
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4.  OPTIMAL  DETEEMINISTIC  DISCRETE-TIME  OBSERVERS 

Constructions  of  optimal  observer  gains  for  the  various  full-order 
observers  is  given  in  this  section.  These  optimal  gains  will  be  used  in  the 
next  section  to  obtain  a  set  of  optimal  deterministic  observers  which  is 
structurally  and  numerically  equivalent  to  a  corresponding  set  of  Kalman-Bucy 
filters.  To  achieve  these  objectives,  an  appropiate  performance  measure  will 
be  chosen. 


Consider  the  estimation  error  equation  given  by  (26)  for  the  general 
structured  observer,  and  consider  the  performance  measure 

e*(klk)We  (k  !k) 

1  k=0  ^ 

=  ?  e'(o|o)(A  -  GC  -  GCA)'^W(A  -  GC  -  GCA)*'e  (0  jO) 

k=0  ^ 

=  e^(o|o)Xex(o|o),  (35) 


where  W  is  an  nxn  symmetric  positive-definite  weighting  matrix,  and  X  is  an 
nxn  symmetric  matrix  given  by 

X  =  ?  (A  -  GC  -  GCA)'1^W(A  -  GC  -  GCA).  (36) 

k=0 


It  follows  from  Lemma  1  that  if  [A,  C]  is  completely  observable  or  completely 
detectable,  then  [A,  ( C* | A* C* )  *  ]  is  completely^  observable  or  completely 
detectable,  respectively,  so  that  there  exist  G  and  G  matrices  such  that 
(A  -  GC  -  GCA)  is  asymptotically  stable.  The  asymptotic  stability  ol 
(A  -  GC  -  GCA)  ensures  that  X  given  by  (36)  exists  and  therefore  is 

finite  for  finite  e35(oIo).  Furthermore,  since  W  is  positive-definite,  it 
follows  that  X  is  positive-definite  as  may  be  seen  from 

OO  —  -w 

X=W+  ^  (A  -  GC  -  GCA)'kw(A  -  GC  -  GCA)k.  (37) 

k=l 


Also  it  can  easily  be  shown  that  X  satisfies 


X  =(A  -  GC  -  GCA)*X(A  -  GC  -  GCA)  +  W.  (38) 


Now  since  given  by  (35)  is  explicitly  dependent  on  the  generally 

unknown  initial  estimation  error  ex(o|o),  minimization  of  will  giv^e 
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rise  to  optimal  gains  which  are  dependent  on  the  unknown  initial  condition 
ex^olo).  This  difficulty  may  be  alleviated  by  assuming  that  exl0|0;  is  a 
random  vector  with  arbitrary  constant  covariance  given  by 


E[e  (olo)e’(olo)]  =  Y,  <39) 

X  X 

where  Y  is  an  nxn  positive-semidef inite  matrix.  An  alternate  performance 
measure  is  then  given  by 


J2  =  trlXY], 


(40) 


where  tr[-]  denotes  the  trace  of  [‘J.  To  accomplish  the  development  of 
general  structured  observers  which  are  structurally  and  numerically  equivalent 
to  the  Kalman-Bucy  filters,  let  the  arbitrary  covariance  matrix  Y  be  taken  as 

Y  =  Q  +  GR^G'  +  GR2G'-GN£  -NjG' -•GN2-h2‘^' > 

where  Q  is  an  nxn  positive-semidef  inite  matrix,  and  R2 

positive-definite  matrices  and  %  and  N2  are  nxm  matrices,  to  be  assigned 
such  that  QqQq  ^  positive-semidef  inite  . 

The  constrained  optimization  problem  posed  by  (38)  and  (40)  may  be 
reformulated  as  an  unconstrained  optimization  problem  by  minimizing,  wi 

respect  to  G,  G,  X  and  I,  the  Lagrangian 

J  =  tr{XY+((A  -  GC  -  GCA)'X(A  -  GC  -  GCA)  +  W  -  X]  T  ')  (42) 


where  X  is  an  nxn  Lagrange  multiplier  matrix. 

Using  the  notion  of  gradient  matrices  [21-24],  the  necessary  conditions 
for  an  extremum  are  given  by 


3G 


=  0 


> 


_3J 

9X 


0 


(43) 


The  following  lemma  will  be  used 
a  symmetric  matrix  and  the  associated 


to  evaluate  the  gradient  with  respect  to 
extremum  conditions  in  (43). 


Lemma  2 :  Consider 
f  =  tr[A2] 


(44) 
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where  A  is  an  nxn  arbitrary  matrix  and  Z  is  an  nxn  symmetric  matrix.  Then  [24] 


^  =  A  +  A  -  diag  [A]  ,  (45) 

where  diag  [A]  =  diag  {  aii>  a22»  •••>  ^nn ^  »  with  a^^  being  the  diagonal 

3  f 

elements  of  A.  Furthermore,  if  A  is  also  symmetric,  then  *  0  if 

and  only  if  A=0.  ^ 

Assuming  that  X  and  Z  are  symmetric,  evaluation  of  the  gradient  matrices 


in  (43)  yields 

0  =  2X[G(Ci:c'+Ri)  -  AEC  +  GCAEC'-Ni]  (46) 

0*  2X[g(cai:a'c+R2)  -asa'c'  +  GCIA'C'-N2]  (^7) 

0  =  Y  +  (A-GC-GCA)E(A-GC-GCA)’-2,  (48) 

0=  (A-GC-GCA) 'X(A-GC-GCA)  +  W-X,  (49) 


where  Lemma  2  has  been  applied  to  obtain  (48)  and  (49). 

Rema rk  4 ;  It  is  noted  that  in  the  literature,  such  as  [16,  17,  21],  little 
attempt  has  been  made  to  explain  the  evaluation  of  gradients  with  respect  to 
symmetric  matrices,  although,  in  general,  correct  results  are  given.  k 

The  above  results  may  be  summarized  in  the  form  of  a  theorem  Isee  also 
Lemma  l)  as  follows: 

Theorem  2:  Suppose  (l)  is  completely  observable  or  completely  detectable. 

Let  Q,  Ri,  R2>  *^1  and  N2  be  the  weighting  matrices  satisfying  the 
conditions  specified  for  (41).  Then  the  optimal  full-order  general  structured 


observer  for  (l)  in  the  sense  of  minimizing  the  performance  measure 
J  =  tr[X(Q  +  GRj^G'  +  GR2G*  -  GNj^  -  N^^G  -  GN^  -  N2G')]  (50) 

subject  to  the  algebraic  constraint 

X  =  (A  -  GC  -  GCA)'  X  (A  -  GC  -  GCA)  +  W,  (51) 

is  given  by 

x(k+llk+l)  =  (A-G*C-G*CA)  x(klk)  +  G*y(k+l)  +  G*y(k)  +  (B-G*CB)u(k) ,  (52a) 

x(0|0)  »  z^,  ^52b) 

where  W  is  asymmetric  positive-definite  weighting  matrix,  and 
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(53) 


[G*  !  G*]  =  (ASC'+N^  j  AEA'C'+N^)  j*^C  j 

with  Z  being  the  S3mimetric  positive-definite  solution  of  the  algebraic 
Riccati  equation 


Z(C’  I  A'C)  +  [r^  0  T 

0  R^ 


Z  =  AZA’  +  Q  -  (AZC'+N  •  AZA'C’+N, 


,)  rrclzCC  1  A'C')  +  0"jTY(A2C'+Nj^)  '  "1 

LcaJ  [o  rJ  (AZA’C'+N^)'] 


(54) 

Furthermore,  the  asymptotic  stability  of  [A  -  G*C  -  G*CA]  is  guaranteea  by  the 
complete  observability  or  detectability  of  the  pair  [A,  (C'  ■  A'C')']  and  the 
complete  reachability  of  the  pair  [A,  Qq]*  A 

The  optimal  gains  for  the  current-update  observer  (24)  and  the  one-step 
predicting  or  Luenberger  observer  (25)  can  be  obtained  in  a  similar  procedure 
by  setting  either  G=0  or  G=0.  The  results  are  summarized  below. 

Corollary  3;  Suppose  system  (l)  is  completely  observable  or  completely 
detectable.  Let  Q  =  Q2»  ^2,  and  R2  be  as  specified  for  (41)  and  let  G=0 
in  (41).  Then  the  optimal  full-order  current-update  observer  for  (l),  in  the 
sense  of  minimizing  the  performance  measure 


J  =  tr(X(Q2+  GR2G'  -  GN^  -  N2^' ^ J » 


(55) 


subject  to  the  algebraic  constraint 


X  =  (A-GCA)'  X  (A-GCA)  +  W, 


(56) 


is  given  by 

x(k+llk+l)  =  (A-G*CA)x(klk)  +  (B-G*CB)u(k)  +  G*y(k+i), 

=  Ax(k|k)  +  Bu(k)  +  G*[y(k+1)  -  CAx(k|k)  -  CBu(k) ]  ,  (57a) 

x(0|0)  =  z  ,  (57b) 

‘  o 

where  W  is  a  symmetric  positive-definite  weighting  matrix,  and 

G*  =  (AZA'C'+N2)(CAZA'C'+R2)'^,  (58) 
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with  Z  being  the  symmetric  positive^def inite  solution  of  the  algebraic 
Riccati  equation 


Z  =  ASA'  -  (ASA'C'+N2)(CAZA'C'+R2)“UAZA'C'  +  ^2)'+  Q2. 


(59) 


Furthermore,  the  asymptotic  stability  of  (A-G*CA)  is  guaranteed  by  the  complete 
observability  or  detectability  of  the  pair  [A,CA]  and  the  complete 
reachability  of  the  pair  [A,Qq].  ^ 

Corollary  4:  Suppose  (l)  is  completely  observable  or  completely  detectable. 
Let  Q  =  Q]^,  and  be  as  specified  for  (41)  and  let  G-0  in  i.41). 

Then  the  optimal  full-order  one-step  predicting  or  Luenberger  Observer  for 
(l),  in  the  sense  of  minimizing  the  performance  measure, 


J  ~  tr[X(Q2^+  GRj^G*  ~  ""  Nj^G’)]. 


(60) 


subject  to  the  algebraic  constraint 


X  =  (A-GC)  ’  X  (A-GC)  +  Viy 


(61J 


is  given  by 


x(k+l  ik)  -  (A-G*C)x(k  I  k-1 )  +  Bu(k)  +  G*y(k)  , 

=  A  x(klk-l)  +  Bu(k)  +  G*[y(k)  -  CxCkIk-l)]  ,  (62a) 

xC0|-l)  =  ,  ^g2b) 

where  W  is  a  symmetric  positive-definite  weighting  matrix,  and 

G*  =  (ASC'+NiXCSC  +  (63) 


with  E  being  the  symmetric  positive-definite  solution  of  the  algebraic  Riccati 
equation 


S  =  ASA'  -  (ASC'+iN3^)(,CSC'+R]_)~((ASC'+Ni)'  +  . 


(64) 


Furthermore,  the  asymptotic  stability  of  [A-G*C] 
observability  or  detectability  of  Che  pair  [A,C] 
of  the  pa ir  [A, Q^] . 


is  guaranteed  by  the  complete 
and  Che  complete  reachability 

A 
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The  optimal  observer  gains  basicall}^  depends  on  the  solutions  of  the 
Riccati  equations  (54),  (59)  and  ( 6 4)  •  Numerous  techniques  for  solving  such 

equations  are  well  known  [30“33]. 


5.  RELATIONSHIPS  AMONG  THE  OPTLMAL  FULL-ORDER  OBSERVERS  AND  THE  KALMAi^-BUuY 
FILTERS 

The  objective  of  this  section  is  to  establish  useful  relationships  among 
the  optimal  full-order  observers  developed  in  Section  4  and  the  various 
Kalman-Bucy  filters. 

Consider  the  system 


x(k+l)  =  Ax(k)  +  Bu(k)  +  w(k)  ,  x(0)  =  x 

o 


( 65a) 


with  noisy  measurements  given  by 


y(k)  =  Cx(k)  +  v(k)  ,  (65b) 

where  x(k),  u(k),  y(k),  A,  B  and  C  are  as  given  in  (l),  and  w(k)  and  v(k)  are 
independent  zero-mean  Gaussian  white-noise  vectors  with  covariances  and 

respectively. 

A  set  of  steady-state  or  time-invariant  Kalman-Bucy  filters  are  given  by 
[3]  : 

True  filter: 


x(k+llk+l)  -  Ax(klk)  +  Bu(k)  +  L[y(k+1)  -  CAx(klk)  -  CBu(k)j  ,  (66a) 


x(0|0)=Zo  . 


( 66b  ) 


One-Step  predicting  filter: 


x(k+llk)  -  Ax(klk-l)  +  Bu(k)  +  K[y(k)  -  Cx(klk-l)]  ,  C67a) 

x(0[-l)  -  ,  (67b) 


where 


K  =  Afc'  (cfc'+R^)~l  , 
f  =  APA'  -  AFC' (crc'+R^)“^CrA'+  Qw 


(69) 


(70) 


where  P  is  the  steady-state  error  covariance  associated  with  the  one-step 
predicting  filter  (67).  Furthermore,  the  above  filters  (66)  and  (67)  are 
related  by  the  following  equations  [3]: 


^leasureme^t  update: 

x(k+l|k+l)  =  x(k+l|k)  +  L  [y(k+l)  -  Cx(k+llk)],  x(0|-l)=z  ,  (71) 

p  =  p  -  rc'(crc'+R^)-icr  .  (72) 

Time-update: 

x(k+l|k)  =  Ax(k|k)  +  Bu(k),  x(0[0)=z^,  (73) 

P  =  APA'  +  Q^.  ,  (74) 


where  P  is  the  steady-state  error  covariance  associated  with  the  true  filter 


_  to  be  verified  that  the  above  set  of  Ralinan“Bucy  filters  can  be 

derived  from  the  results  of  Section  4.  More  specifically,  the  aim  is  to  show 
that  Corollaries  3  and  4  for  the  optimal  deterministic  observers  can  be 
e()uivalent  to  the  steady— state  true  filter  and  one— step  predicting  filter. 
The  measurement-update  and  time-update  relationships  for  the  filters  and 
observers  will  also  be  established.  It  is  easy  to  see  that  Corollary  4  gives 
rise  to  the  one— step  predicting  filter  if  the  weighting  matrices  Nj^=0, 
^1  ”Qw  ®od  chosen  for  the  performance  measure  (60),  tor  then 

(63)  and  (64)  reduce  to,  respectively. 


G*  =  AEC'(CEC’+Rv)"l 
I  =  A2A'  -  AZc' (C2c'+R„)-1c2a'  +  Q„, 


which  are  structurally  and  numerically  equivalent  to  (69)  and  (70),  respectively, 
that  is,  G  =  K  and  Z  *  p.  On  Che  other  hand,  if  the  weighting  matrices 
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Q2  ■  Ql  “  Qw>  ^2  =  QwC'  and  R2  =  ^  chosen  for 

the  performance  measure  (55)  in  Corollary  3,  then  (58)  and  (59)  yield 

G*  *  (AZA'  +  Q„)C'[C(AZA'  +  Q^)C'  +  (77) 

Z  =  (AZA'  +  Q^)  -  (AZA'y  +0^)C' [C(AZA'  +  Q„)C'  +  R^]“lc(AZA’  +  Q^,)'.  (78) 


Denoting,  with  Qi  = 


Z  =  AZA'  +  Q^, 


(79) 


equations  (77)  and  (78)  become,  respectively. 


G*  =  ZC  (CZC'  +  Rv)"^, 

(80) 

Z  =  Z  -  ZC'(CZC'  +  Rv)~lcz  . 

(81) 

Substituting  (8l)  into  (79)  results  in 

Z  =  AZA'  -  AZC'(CZC'  +  R^)"1CZA'  +  Q^,  (82) 

where  it  is  seen  from  (  70),  (  76)  and  (82)  chat  i.  =  Z  “  f.  It  follows  from 

(68)  and  (80)  that  G*  =  L.  Furthermore,  since  (79)  and  (81)  are  identical  to 
(74)  and  (72),  respectively,  it  can  be  concluded  that  Z  =  F.  With  G*  =  L,  the 
measurement“update  equation  (71)  and  the  time~update  equation  (73j  for  the 
Kalman-Bucy  filters  are  equivalent  to  the  corresponding  optimal 

measurement~update  equation  (30)  with  G  =  G*,  and  the  time~update  equation 
(29).  A  complete  equivalence  between  the  optimal  deterministic  observers  and 
the  t ime“inva riant  Kalman“Bucy  filters  is  thus  established.  Finally  ^  ,  it 

is  interesting  to  note  that  the  use  of  Corollary  4  and  the  relationship  G  =  AG 

1x1  (32)  can  also  be  used  to  obtain  the  true  filter  (66)  from  the 

current“updaCe  observer  (24). 
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CONCLUSIONS 


The  development  of  a  class  of  deterministic  general  structured  observers  has 
been  considered.  From  this  class  of  general  structured  observers,  the  current- 
update  observers  and  the  one-step  predicting  or  Luenberger  observers  can  be 
derived.  Interesting  properties  relating  the  full-order  observers  are  pointed 
out.  Optimal  gains  for  these  full-order  observers  are  derived  using  an 
appropiate  performance  measure.  It  is  shown  that  these  optimal  deterministic 
observers  can  be  equivalent  to,  and  hence  may  be  as  effective  as,  the  steady- 
state  or  time-invariant  Kalman-Bucy  filters.  The  results  of  this  paper  have 
been  extended  to  include  reduced-order  observers  and  time-optimal  properties 
of  the  observers.  The  details  will  be  given  elsewhere. 
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APPENDIX  A:  Proof  of  Theorem  1:  [see  also  [34]] 


For  sufficiency,  it  is  clear  that  if  (7) -(10)  hold,  then  (6)  reduces  to 

e^^h+llk+l)  ®  Fe2(klk),  62(0)  “  ^zo> 
ex^klk)  =  Pe2(k|k), 


so  that  e,(k  I  k)  approaches  the  origin  asymptotically;  since  P  is  of  full 
rank,  ex^k)  also  approaches  the  origin  asymptotically. 

Conversely,  suppose  e2^k|k)  ~  z(k|k)  “  Tx(k)  0  and  ex(k|k)  ->  0  as 
k  for  all  z(0),  x(  O)  and  u(k),  k  =  0,  1,...  .  Then  setting  x(0)  =  0  and 

u(k)  “  0  establishes  the  necessity  for  condition  (a)  in  (6a).  On  the  other 
hand,  condition  (c)  must  hold,  for  otherwise  a  u(k)  would  always  exist  to 
drive  e2^k  |  k)  away  from  the  origin.  Likewise,  unless  condition  (b)  is 
satisfied,  the  reachability  of  (l)  ensures  that  there  would  always  exist  a 
x(k),  driven  by  u(k),  which  makes  e2^k  |  k)  ^  0  in  (6a).  Lastly,  e2(k  |k) 

0  and  ex(k  I  k)  0  as  k->-<“  yield  ex(k|k)  =  [PT  +  VC  -  Ip]x(k)  -»■  0 

for  arbitrary  x(k),  thus  establishing  the  necessity  for  condition  (d) .  ^ 
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ON  THE  DESIGN  OP  PREDICTORS  PORTFIRE  CONTROL  SYSTEMS 
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BACKGROUND 

The  design  of  fire  control  systems  has  been  characterized  by  a 
continuing  controversy  regarding  the  extent  of  estimation  and 
prediction  which  should  be  mechanized.  One  extreme  argues  that 
the  critical  process  is  the  tracking  process  which  viith  current 
technology  produces  such  large  errors  that  a  fire  control  system 
should  not  attempt*  to  lead  a  maneuvering  target.  The  other  ex¬ 
treme  argues  that  the  methodology  is  at  hand  to  deal  with  large 
observation  errors  and  model  identification.  Thus,  arbitrarily 
high  order  estimators  and  predictors  should  be  implemented  to  pro- 
jduce  a  lead  for  engaging  maneuvering  targets.  i 

Without  taking  an  a  priori  position  on  the  above  issue,  this  re¬ 
port  attempts  to  formalize  some  of  the  processes  which  may  resolve 
the  design  problem.  The  problem  is  viewed  as  a  trade-off  between 
the  errors  incurred  in  propagating  incorrect  estimates  through  a 
predictor  versus  the  errors  Incurred  in  failure  to  utilize  an 
available  estimate  of  a  target  state  variable.  The  problem  is 
similar  to  one  which  arises  in  numerical  analysis  wherein  some 
situations  "cancellation  errors"  may  be  larger  than  a  computed 
result . 

The  design  of  predictors  will  be  considered  in  the  situation  where 
the  predictor  exists  in  tandem  with  a  Kalman  estimator.  This 
introduces  no  loss  of  generality,  and  yet,  it  is  an  acknowledge¬ 
ment  of  the  current  state  -of-the-art.  This  structure  does 
permit  a  presumption  that  state  estimates  and  their  variances 
are  available  for  this  analysis.  The  Kalman  estimator  would 
employ  models  of  the  form. 

Target  Model 


^k+1  ~  ‘{’k^k'^Bk^k 
Observation  Model 


Yk  =  HkXk+Vk 


The  Kalman  estimator  would  produce  a  sequence  of  estimates,  Xjc, 
and  variances  of  the  estimation  errors,  P^.  If,  at  some  time, 
it  is  necessary  to  predict  ahead  N  steps  without  observations, 
the  best,  linear,  unbiased  estimator  is 
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v/hlch  would  be  approximated  by 


^  M  A 

Vn  ■  (♦k>"  =< 


k 


ThG  variEncG  of  tho  estimation  error  at  k+N  is  obtained  from  the 
recussion 

^k+i "  ^k^k^k’^  ^kVk’ 

where  Qj^  i  £(11^11^^').  Extrapolating  this  to  k+N  gives 

fk+N  =  ♦k''''k(*k'')  ^j^k-' 


which  will 


be  viewed  as 

Pk+N  =  i 


composed  of 
Propagated  variance 
of  the  estimates 


+ 


Error  variance 
Induced  by 
subsequent  tar¬ 
get  motion 


So  far,  this  discussion  seems  to  indicate  that  the  design  and  analy¬ 
sis  of  the  predictor  of  is  a  closed  issue  and  that  the  vari¬ 

ances  are  left  as  a  take— it— or— leave— it  result.  However,  there 
is  a  choice  to  be  made  if  one  recognizes  that  the  model  is  only 
one  of  a  number  of  possible  models.  Furthermore,  even  if  the 
models  are  correct,  there  may  be  a  better,  biased  predictor.  To 
formalize  this  notion,  a  predictor  is  defined  as 

/V  *  A 

\+n  "  ***  k\ 

where  is  not  necessarily  i{i  The  loss  function  of  Interest 

is  the  norm  of  the  prediction  error 


ilEpIl  =  liXic+N-Xk+N » 

and  the  risk  (which  is  to  be  minimized)  will  be 
R(<|)*)  =  ELnEpll] 

that  is,  we  will  seek  that  predictor  algorithm,  <{)*,  which  mini¬ 
mizes  the  expected  norm  of  predictor  error. 

Although  the  problem  just  formulated  is  of  interest  in  itself, 
there  is  an  equally  interesting  dual.  That  is,  given  a  maximum 
allowable  risk,  what  predictor  algorithm  will  lead  to  the  best 
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overall  estimator-predictor  performance?  Here,  performance  is 
viewed  in  terms  of  constraints  placed  upon  sight  technology  and 
target  agility.  The  least  constraining  design  will  be  viewed  as 
the  best  design. 

FIRE  CONTROL  SYSTEMS  AS  A  SPECIAL  CASE 

The  problem  definition  stated  in  the  previous  section  leaves  a 
large  number  of  parameters  to  be  considered  in  the  general  case. 

To  illustrate  the  utility  of  the  approach,  the  subsequent  discus¬ 
sion  will  be  limited  to  situations  which  arise  in  the  design  of 
fire  control  systems.  The  following  limitations  will  be  presumed: 

Only  the  positional  component  of  the  predictor  error  will  be  in¬ 
cluded  in  the  loss  function. 

The  model  to  be  employed  in  the  estimator  is  of  the  form 

1  At  At2/2.  ..." 

1  At...  ;  ~ 

0  1  . 


Hk  =  [100...  0] 


The  prediction  algorithm  will  be  of  the  form 

Position  '  Vf  •  •  •  •] 

Where  tf  =  NAt,  and  the  yv,YA>  etc.,  are  the  optimization 
parameters. 

The  case  of  greatest  interest  in  current  applications  is  the  one 
with  a  3-dimenslonal  state  vector 

At^/ 2 

At  ;  B  = 

1 

H  =  [10  0] 

Here,  the  state  vector  is  the  position,  velocity,  and  acceleration 
along  a  single  axis  of  the  target 
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The  prediction  problem  along  each  axis  of  the  target  can  be  treat 
ed  independently  for  design  purposes.  In  real-time,  however, 
one  would  expect  inherent  coupling  of  the  state  equations  along 
the  respective  axes.  In  other  words,  we  will  be  generating  up 
to  three  designs  to  be  coupled  together  in  a  real-time  Implementa 
tlon. 


Concentrating  upon  a  single  coordinate  axis,  the  loss  function 
may  be  defined  as 

.p2  -  (?  - 

Here  Ilell2  is  used  as  a  loss  function  for  conciseness.  In 
turn,  the  risk  is 

R(<fr*)  =  E(ep2) 

Using  the  definitions  of  x  and  x  as  well  as  the  notation 

T  =  [1  tj  =  [♦'‘^position 

T  =  [1  Tfytp  =  [♦  ^Position 

the  risk  condenses  to 

E(e  2)  =  T*Pj^T*'+T*Zj^(T*-T)  ' 

+  (t*-t)z\t*' 

+  (T*-T)E(Xi^Xj^' )  (T*-T)  ' 

+  E  ( s^TI ^ 

where  =  E[(Xj^-Xjj)x 

and  ECe^rpj)  =  Variance  of  the  target  Induced  error 

Thus,  the  risk  is  composed  of  terms  which  are  Interpreted  as 

E(ep2)  =  Variance  propagated  from  the  estimation  errors 

+  I  Correlations  between  estimation  errors  and  states 
1  (2  terms) 

+  (Mean  squared  magnitudes  of  the  target  states  after 
\ propagation 
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+  Error  variances  induced  by  subsequent  target  motion 

Since  the  last  term  is  not  a  function  of  the  optimization  parame¬ 
ters,  it  will  be  treated  as  a  constant.  The  optimization  involves 
a  trade-off  between  the  other  terms.  Thus,  a  new  risk  is  formu¬ 
lated  as  the  residual  risk,  E(ej^2),  due  to  the  choice  of 
the  prediction  algorithm. 

E(ej^2)  =  E(ep2)  -  E(e2^j) 

In  the  first  three  terms  of  the  risk  equation,  the  filter  parame¬ 
ters,  Pit,  may  be  determined  as  ratios  Pjc/q  ^-^d  Z^/q  at  steady 
state.  These  ratios,  as  well  as  r/q,  are  functions  of  a  single 
bandwidth  parameter,  u  ,  for  the  target  velocity.!  (Note 
that  q=  E(u2);  r=  E(v2y).  After  dividing  the  risk  equation 
by  q,  and  gathering  terms,  the  dual  optimization  problem  becomes 
one  of  maximizing. 

K(u)v)  •N(S,tf  ,Yv>Ya) 

r/E(e  =  - 

D(uv,tf , yv,YA) 

S  =  E(XX' )/E(e^2) 

N(S,tp.,Y^,Y^)  =  y2+S22(tp.)2.x2 

+  2S231 (tf^/2) ‘Xy] 

D(u)^,tp.,Yv,YA)  =  T  (P/q)T’-  [  (P33/q) .  ( tf.2/2)2y2 

+  (P22/q) (tf )^*x^+2(P23/q) (tf2/2) .xy] 

v/here  x  =  1-Yv  and  y=l-YA 
KCwy)  =  r/q 

At  the  steady  state  design  condition,  the  cross  correlations,  Z/q, 
are  precisely  the  negative  of  the  variance  ratios,  l.e., 

Z/q  =  -P/ q 

Given  the  target  characteristics,  specified  in  rough  terras  by  S 
and  oiy,  and  the  maximum  allowable  risk,  the  designer  must  choose 
Yv  and  ya  to  maximize  r.  Maximizing  r,  in  turn,  maximizes 
q  since  q/r  is  a  constant.  Thus,  the  estimator-predictor  combina¬ 
tion  is  least  constraining  in  respect  to  the  sight  technology 
and  target  agility. 

The  final  form  of  the  performance  criterion  is  a  rational  function 
in  X  and  y.  Further  direct  analysis  of  the  extrema  of  this 


47 


function  is  possible,  but  some  preliminary  insights  may  be  possi¬ 
ble  by  taking  cross-sections  of  the  contours  with  respect  to 
coordinate  planes.  This  corresponds  to  examining  single  variable 
perturbations  about  the  standard  first  and  second  order  predictors. 

SOME  SPECIAL  PREDICTOR  DESIGNS 


Two  interesting  special  cases  offer  some  Insight  into  the  dual^ 
optimization  of  the  predictor  design.  In  each,  one  of  the  design 
parameters  will  be  set  to  its  natural  value  (i.e.,  i=l). 


r/E(er,2)| 

y=0 


K(a»^)Cl~S22(tf)2x2] 

T  (P/q)T'-(P22/q)(tf)^x^ 


r/E(e  i 

x=0 


K(a)^)[l-S33(tj.2/2)2y2] 

T  (P/q)T'-(P33/q)(tj.2/2)V 


Both  of  those  functions  are  of  the  same  general  form,  and  both 
are  symmetrical  about  the  origin  of  the  free  variable  (i.e., 
symmetrical  about  y=1)*  The  general  form  is 


where 


[l-a(tf.,S)x2] 


VE(ep2)  - 

C(a)  ) . - 5- 

[l-b(tf.,a)v)x^] 

It 

3 

O 

r/q 

T  (P/q)T’ 

a(t„  S)  = 

^li*  l^^f^”^^(l-l) 

b(tj.,a)^) 

=  [(Pil/q)/(T(P/q)T’)]^Ct 

This  function  is  a  ratio  of  parabolas  whose  shape  is  determined 
by  the  relative  values  of  a(tp^S)  and  b(tp,t»)y).  Three  shapes 
can  occur  as  shovin  in  Figure  1.  The  numerical  values  are  taken 
from  the  next  section.  Thus,  the  offset,  x,  represents  a  typical 
velocity  correction  and  y  represents  a  typical  acceleration 
collection. 


ANALYSIS  OF  A  NOMINAL  CASE 

As  an  example  of  the  design  process  developed  in  the  previous 
section,  consider  the  case  of  a  target  described  by 

Wv  =  0.15  hz  (velocity  bandwidth) 
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tf  =  1.5  sec  (time  of  flight) 

Referring  to  the  estimator  design  procedure  in  Reference  1,  the 
variance  ratios  are 

(P/q)3  3  =  .106103 

"  2I3  =  .056289 
"  1,2  =  .015796 

”  1,3  =  .01il217 

’•  2,2  =  .0^14333 

”  1,1  =  .008765 

(r/q)  =  .028993 

which,  in  turn,  specifies 

r/q 

C(a>  )  =  - =  0.0563 

T(P/q)T 

The  optimization  is  done  with  respect  to  a  maximum  allowable  risk 
which  will  be  taken  to  be 

E(ep2)  =  (1.15)2m2 

The  levels  of  the  target  state  variables  will  be  nominally  assessed 
at 


E(V2)  =  2  m/sec 
■^E(A^)  =  0.4  m/sec2 

Utilizing  the  definitions  in  the  previous  sections 


a( tf ,S) 


S22(tf2)  =  6.80  I 

S33(tf2/2)2  =  .153  ’ 

(P22/q>(tf2)/T(P/q)T'  =  .199 

(P33/q)(tf2/2)2/T(P/q)T’=  .261 


b  ( tf ,  ujv ) 


The  dimensionless  optimal  designs  are  characterized  in  each  case 

by 


aCtfjUy)  >  b(t,a)v)  for  y=0 
a(tf,a)y)  <  b(t,ajy)  for  x=0 


Thus,  the  best  value  of  Yv  is  1.0  while  the  worse  value  of  ya 
is  1.0.  One  possible  predictor  is 

T*  =  [1  t^]  (l.e.,  first  order) 
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That  is,  let  Yv=1-0  and  Ya=0-  The  performance  Is  character¬ 
ized  by 


r/E(tpj2)  =  .0646 


versus 


r/E(e  p2)  =  c(.>^)  =  .0563 
for  the  natural  case  of 

T*  =  [1  t^-.  t^2/2-j 

It  must  be  noted  that  this  is  only  one  of  a  myriad  of  such  results. 
This  one  is  not  surprising  in  light  of  the  low  level  of  accelera¬ 
tion  involved  (i.e.,  yf  E(a2)  =0.4  m/sec2).  Illustrating  the 
rest  of  the  design,  note  that 

=  ^0.06k6  .  E(e^^2)  =  .292  meters 

VT  =VTh/q)^*h  =  1.71  m/sec3 

These  are  the  maximum  tolerable  levels  of  observation  error  and 
target  agility  for  the  desired  miss  variance,  E(e^‘=^).  Thus, 
not  only  is  the  optimal  predictor  algorithm  found,  but  technolog¬ 
ical  constraints  are  also  available  from  the  design. 

UNREALIZABLE  DESIGNS 

It  is  evident  from  tlie  general  analysis  of  the  optimal  predictor 
designs,  that  some  of  the  parameter  values  lead  to  negative  values 
of  r/E(e„2).  One  could  argue  that  these  are  never  cases  of  in¬ 
terest  since  they  do  not  occur  in  the  neighborhood  of  the  natural 
parameters  vrhen  a<b,  and  when  a>b,  the  natural  value  is  clearly 
optimal.  Nonetheless,  an  interpretation  of  the  results  of  when 
r/E(e  j^2)  is  negative  is  called  for  since  the  parameter  values 

are  realizable. 

Since  the  desired  risk,  E(epj2),  is  asserted  a  priori,  the  case 
of  negative  r/E(co^)  corresponds  to  situations  where  that  risk 
is  not  achievable  by  any  r,q  combination.  This  may  arise  because 
either  the  magnitudes  of  the  states  are  too  large,  the  risk  is 
too  small,  the  time  of  flight  is  too  large,  or  the  variance  of 
the  estimation  errors  is  too  large.  Of  all  of  these  conditions, 
the  time  of  flight  is  the  most  pervasive  since  it  effects  both 
the  pole  and  zero  placement.  Thus,  the  whole  structure  of  the 
optimal  design  as  determined  by  the  relative  magnitude  of  a(ts,S) 
and  b(ts,a)v)  may  be  signif icantly  altered  by  the  time  of  flight 
(i.e.,  the  distance  to  the  target). 
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PREDICTING  COORDINATED  TURNS 


As  an  alternate  approach  to  the  analysis  of  propagated  errors, 
the  design  of  a  predictor  may  be  predicated  upon  some  particular 
maneuver  type.  The  maneuver  type  would  then  supplant  the  polyno 
mial  form  which  emerges  from  the  Cartesian  estimator.  One  very 
natural  model  for  the  subsequent  motion  of  the  target  is  the 
"steady  turn"  wherein  the  acceleration  vector  remains  fixed 
with  respect  to  the  target,  but  it  may,  of  course,  rotate  in  an 
inertial  frame.  The  predictor  model  is  formulated  as 


• 

X 

0 

0 

COS0 

0 

0 

0 

r 

X 

y 

0 

0 

sln0 

0 

0 

0 

y 

V 

0 

0 

0 

1 

0 

0 

V 

At 

0 

0 
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0 
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0 
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At 

1 

0 

0 

0 

0 

0 

0 

0 

V 

_  Al 

0 

0 

0 

0 

0 

Al_ 

where  x  and  y  are  the  target  coordinates  in  an  inertial  frame,  V 
is  the  total  speed  of  the  target.  At  and  Aj,  are  the  longitudinal 
and  lateral  acceleration  respectively,  and  9  is  the  orientation 
of  the  target  motion  (l.e.,  Vx  =  Vcos0). 

Given  fixed  At  and  Ag,  the  V  and  0  components  may  be  integrated 
over  the  time  of  flight. 

V(t+tf)  =  V(t)  +  ATtf 

9(t+tf.)  =  0(t)  +  —  log  (V(t+tf.)/V(t) ) 

At 

The  positional  Integrals  are  approximated  by  "stepping”  9  and  V. 


N-1  1 

x(t+tf.)  =  x(t)  +  ( cos9 )  At+— 

i=0  2 


N-1 

At^  I  [cos9. ( 2i+l ) ]Am 
i=0 


M-1  1 

y(t+tf.)  =  y(t)  +  I  (cos0  .V.  )At+_ 

1  =n  ^2 


N-1 

At2  I  [sln9^  (2i+l)]Arr. 
i=0 


where 


Vi  =  V(t)  +  ATAt 


0 


i 


0(t)  + 


_  log  (Vi/V(t)) 
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N  =  tf/At 

This  formula  for  x(t+tf)  and  y(t+tf)  Is  a  closed  form  predictor 
as  it  stands.  Given  the  time,  and  computer  resources,  this 
could  be  the  basis  for  a  "steady  turn"  predictor. 


A  more  efficient  version  of  the  predictor  may  be  discerned  by 
forcing  the  model  to  be  circular  in  form  (l.e.,  A-p  =  0).  This 
is  the  "coordinated  turn."  Prom  this  aproxirnation  we  get 

V(t+tp)  =  V(t)  =  V 

Al 

e(t+tf.)  =  e(t)  +  —  tf  =  0(t)  +  u)tj. 

V 

I  N-l 

X(t+tf)  =  X(t)  +  -  I  COSSj^Vtf 

N  j_=o 


yCt+tj-.) 


1  N-l 

y(t)  +  —  y  SlnS.Vtf. 

N  1=0  ^  ^ 


01  =  0(t)+luAt 

Not  only  is  this  a  simpler  and  more  readily  implemented  predictor 
but  it  is  also  beginning  to  show  some  of  the  same  form  as  the 
"optimal"  polynomial  predictors. 

In  order  to  Illustrate  the  similarity  between  the  circular  predic 
tor  and  the  polynomial  predictors,  the  trigonometric  terms  will 
be  expanded  about  the  starting  angle,  0Q=0(t). 

COS01  =  cos0oCOs(la)At)-sin0oSin  (iuAt) 

slnOi  =  sln0ocos(  ia)At)+cos0oSln(ia)At) 


(l(oAt)2 

then  cos(la)t)  «1-  _  +  o[(ia)At)A] 

2 


( iwAt ) 3 

sln(luAt)  =  iwAt  -  _ _ +  o  [(ia)At)5] 


If  the  first  two  terms  of  the  expansions  are  used,  the  prediction 
error  due  to  truncation  will  be  bounded  by 

lei  <  [(a)t^)V24]Vt^ 
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Assuming  that  this  bound  is  acceptable,  the  predicted  position 
is  further  approximated  by. 


where 


x(t+t^)  =  x(t)  +  -  Yy^sine^Aj^. tj.2/2 

y(t+t^)  =  y(t)  +  YyVytf.  +  Y^coseQALtj.^/^ 

1  -  __  _  J 

N-1  r  a)2tp2  |vj_i  1 
N  L  12  M  J 


which  for  large  enough  N  is 

a,2t  2 

Y  «[1 - L-  ] 

^  6 


^A 


<-2t.2 

•  [1 - L_] 

12 


Thus,  in  the  final  analysis,  the  coordinated  turn  provides  another 
rationale  for  the  choice  of  Yy  ^^^d  y^.  Together  with  the  pre¬ 
vious  sections,  it  is  now  possible  to  compute  the  propagated 
errors  for  the  circular  predictor  and  compare  them  with  the 
’’optimal"  forms. 


OPTIMIZING  SEVERAL  PREDICTOR  PARAMETERS. 


The  analysis  of  the  previous  section  raises  the  immediate  question 
of  what  performance  can  be  achieved  when  several  parameters  are 
varied  simultaneously.  To  illustrate  that  some  interesting 
shifts  in  the  points  of  optimability  may  occur,  consider  the 
situation  where  the  velocities  and  accelerations  are  uncorrelated. 


S23 

This  is  perhaps  an  extreme  condition,  but  it  is  not  too  far  from 
the  condition  observed  In  moving  vehicles.  Using  the  typical 
case  discussed  in  Section  4,  the  covariance  of  the  estimation 
errors  is 


^23/q  =  0.0563 
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which  substantiates  the  analysis  of  a  two  dimensional,  joint  opti¬ 
mization.  The  performance  contours,  in  this  case,  show  a  slight 
degree  of  assymetry.  The  performance  function  is 

2  _  /  \  1.-6. 8x2-. 153y^ 

Y  /E(ff^  )  ^ -P(P/q)T' /  1-. 199x2-2(0. 184)xy-.26ly2 

In  the  vicinity  of  x=0,  y=l  (the  best  single  parameter  variants) 
the  multi-parameter  optimization  involves  a  balancing  of  the 

6.8x2  in  the  numerator 

and  2(0. 184 )x  in  the  denominator 

The  value  of  x  must  be  on  the  order  of  10“3  in  order  to  show  an 
improvement  in  performance,  and  even  then  the  Improvement  is 
infinitesimal. 

Although  the  example  being  pursued  here  is  only  one  of  a  number 
of  cases  of  interest,  this  type  of  effect  of  P23  has  been  found 
to  be  quite  characteristic.  The  numerator  terms  usually  dominate 
the  performance  contour  in  the  vicinity  of  the  optimum.  The 
extreme  sensitivity  of  the  performance  in  this  region  should  be 
noted.  The  effect  is  even  more  accentuated  at  higher  velocities 
and  accelerations,  and  at  longer  times— of— flight.  The  phenomenon 
tends  to  suggest  the  need  for  over— design  of  the  sighting  mechanism 
to  overcome  these  sensitivities. 

For  purposes  of  Illustration,  the  coordinated  turn,  if  used  with 
the  nominal  conditions  of  Section  4,  would  lead  to 


=  .2  radian 


utf  ®  0.3  radian 

When  used  as  a  rationale  for  the  selection  of  offset  parameters, 
the  Yv  2ind  Ya  become 

Yv  =  0-985 

YA  =  0.9925 

which  leads  to  performances  that  are  barely  distinguishable  from 
the  nominal  second  order  case 

r/E(  ej^2 )  =  q.  0563 


(I)  = 


\/E(A2) 


|/E(V2) 
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USING  THE  COORDINATED  TURN  MODEL  TO  REDUCE  DESIGN  SENSITIVITIES. 

Given  the  results  of  the  analysis  for  the  nominal  case  In  the  pre 
vlous  section,  the  question  of  a  strategy  for  reducing  sensitivi¬ 
ties  arises.  In  order  to  Illustrate  an  alternate  approach  to 
design,  consider  the  nominal  case 

Uy  =  0.05hz  (a  lower  frequency  target) 

tf  =  3  sec 

and  suppose  that  the  velocity  and  acceleration  are  orthogonal 
(l.e.,  circular  motion)  with  magnitudes 

V  =  120  m/sec 

A  =  40  m/sec2 

the  required  accuracy  will  be  taken  to  be 

E(ep2)  =  25  m2. 

Suppose  prediction  Is  to  be  done  along  the  velocity  vector.  The 
first  order  predictor  performance  v/ould  be  characterized  by 

1-5184x2 

r/E(G„2)  =  (.401)  _ 

^  1-. 246x2 

The  undesirable  sensitivity  Is  evident  In  this  case,  and,  thus, 

X  must  be  kept  very  precisely  at  zero.  The  required  sighting 
accuracy  would  be 

r  =  3.7  meters. 

If,  however,  the  circular  motion  Is  taken  to  be  the  nominal  behav 
lor,  then 


where 


a,2t  2 

1  (1-  - )tf. 

6 

0  1 


0)  =  A/V 


The  predictor  would  be  designed  by  selecting  deviations  from  the 
nominal  case,  l.e.. 


(1- 


a)2tp2 


■)  tf-Yy] 
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VJhere  Yv  is  the  offset  parameter  with  respect  to  the  coordinated 
turn 

In  this  case,  the  performance  is  characterized  by 

1-3600x2 

r/S(c  p2)  =  (0.447). - - 

^  1-156x2 

Not  only  is  the  sensitivity  reduced,  but  the  nominal  performance 
is  improved  since 

^  0)  2t2 

T(P/q)T'  =  - ^^f 

6 

<i)  2t2 

+  (P22/q)(l-  _ )2tj.2 


The  sighting  accuracy  can  now  be  relaxed  to 

r  =yo.43B  .yiTe^  =  3.31  meters 

The  corresponding  result  for  prediction  along  the  acceleration 
vector  are 

1-1296x2 

r/E(e„2)  =  (0.401)  . - 

^  1-. 145x2 

for  the  nominally  parabolic  model,  and 

1-1089x2 

r/E(e„2)  =  (0.4l4)  _ 

^  1-0.125x2 

for  the  coordinated  turn  model. 

In  the  context  of  this  work,  the  gross  difference  between  the  co¬ 
ordinated  turn  and  the  parabolic  motion  would  be  considered  to 
be  target  induced  error,  ^tI*  should  be  noted  in  passing, 

however,  that  such  errors  may  be  significant,  i.e. 

e^i  =  60  meters  along  the  velocity  vector; 

eTI  =  15  meters  along  the  acceleration  vector,  due 

to  the  Inappropriate  choice  of  predictor  model 
for  the  current  nominal  case. 
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CONCLUSIONS  AND  RECOMMENDATIONS. 

The  results  of  the  previous  sections  should  be  viewed  as  indica¬ 
tive  of  the  potential  screening  power  of  a  design  methodology 
based  upon  propagated  errors.  Critical  choices  in  predictor 
parameters  are  possible  and  performance  characteristics  are 
discernible. 

Given  the  pervasive  Influence  of  the  time  of  flight,  tf,  on  the 
results  obtained  herein,  one  must  conjecture  at  this  point 
about  the  real  time  features  of  the  methodology.  One  could  feed 
forward  smoothed  estimates  V  and  A  and  compute  the  optimal 
and  YA  during  an  engagement.  The  potential  gain  from  such 
a  scheme  is  so  great  that  it  deserves  some  testing  using  a  fire 
control  simulator. 

It  is  apparent  that  the  major  predictor  design  issues  may  vary 
greatly  with  the  intensity  of  the  target  motion.  For  larger 
leads,  the  Importance  of  the  correct  modeled  nominal  motion 
cannot  be  overemphasized.  From  one  model  to  the  next  there  may 
be  significant  differences  in  sensitivity  and  propagated  error 
as  well.  In  any  case,  for  large  leads,  there  is  seldom  an  oppor¬ 
tunity  to  use  other  than  the  natural  values  of  Yy 

On  the  other  hand,  for  less  intense  motion,  situations  may  arise 
where  propagated  error  completely  dominates  a  correction.  Whole 
terras  may  drop  out  of  the  predictor  as  a  result.  Here,  however, 
the  differences  between  nominal  models  is  not  so  significant. 
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Figure  1  (a)  Contours  of  Predictor,  Performance  Functions. 
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Figure  1  (b)  &  (c)  Contours  of  Predictor,  Performance  Functions. 
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ABSTRACT 

This  study  is  to  investigate  the  feasibility  and  performance  of 
an  adaptive  fire  control  filter-predictor  system  in  the  micro¬ 
processor  environment.  The  filter  model  used  is  our  previous 
design  which  includes  a  Kalman  estimator  for  tank  target  state 
estimation  and  an  UD- factorization  scheme  to  propagate  the  state 
error  covariance  matrix.  This  method  provides  excellent  computation 
stability  and  accuracy.  In  addition,  the  parallel  structure  of 
filters  in  this  model  is  particularly  suitable  for  microprocessor 
implementation.  Three  Intel  86712A  single  board  computers  are  used 
to  process  the  parallel  filters  simultaneously.  The  computation 
accuracy  of  the  target  state  estimates  and  the  processing  time  are 
examined. 


INTRODUCTION 

This  study  is  to  investigate  the  feasibility  of  implementing  and 
the  performance  of  an  advanced  adaptive  fire  control  filter-nre- 
dictor  system  in  real  time  and  in  the  microprocessor  environment. 
The  filter  model  used  is  our  previous  design  which  includes  a 
Kalman  estimator  for  tank  target  state  estimation  and  an  UD-fac- 
torization  scheme,  to  propagate  the  state  error  covariance  matrix. 
This  method  provides  excellent  computation  stability  and  accuracy. 
In  addition,  the  parallel  structure  of  filters  in  this  model  is 
inherently  suitable  for  microprocessor  implementation. 

We  start  our  exercise  with  the  UD- factorization  of  the  state  error 
covariance  matrix.  The  system  configuration  is  considered  next. 
Then  the  interface  of  microprocessors  is  described.  The  require¬ 
ments  of  implementation  are  stated.  Lastly,  the  results  and  con¬ 
clusions  are  addressed. 


UD- FACTORIZATION  OF  THE  STATE  ERROR  COVARIANCE  MATRIX 

In  the  conventional  Kalman  filter  algorithm,  the  error  information 
propagates  through  updating  the  state  error  covariance  matrix. 

They  are 
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where  P  is  the  a  posteriori  state  error  covariance  matrix,  P,  ^ 
the  a  priori  state  error  covariance  matrix,  $,  the  state  transition 
matrix,  H,  the  measurement  matrix,  G,  the  distribution  matrix, 

K,  the  Kalman  gain  matrix,  Q,  the  plant  noise  covariance  matrix, 
and  r,  the  measurement  noise  variance. 

The  P  matrix  is  required  to  be  positive  semi-definite  in  order 
that  the  system  is  stable.  As  we  may  see  in  Equation  (3)  that  the 
difference  of  tr^o  semi-definite  matrices  can  produce  a  negative 
definite  P  especially  when  the  accumulated  round-off  error  becomes 
significant.  Hence,  the  UD- factorization  method  is  adopted  for  its 
inherent  stability  and  enhanced  accuracy. 

The  recursive  formula  of  the  UD-algorithra  for  updating  the  state 
error  covariance  matrix  in  each  measurement  cycle  are 


WDX\r'^  =  (^) 

UDU^  =  WDWT  (5) 

oc  =  h’^UDU^H  +  r  (6) 

K  =  UDU'^H/oc  (7) 

UDU’^  =  u[p  -  ^(DU%)  (DU%)'^]u'^  (8) 

where  'D  =  D  j  0  is  N  by  N  and 

0  !  Q_ 

N  =  n+N^^,  W  =  [$U|gJ  n  by  N, 

G  =  1^-  0  n  by  N  , 
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n  by  n. 


rT  =  (hj^—  hn) 


D,  n  by  n  diagonal  matrix  and 

0 ,  zero  elements . 

_T  _ 

with  b=UH,  V'=DUH  and  initial  conditions 

-o/  '  -a/ 

=T+V]^b]^,  and 

^2  = 

where  Uj[^j  and  K^j  are  the  ith  element  of  the  colimin  vector  Uj 
and  Kj  respectively. 

Then,  the  elements  of  d,  U,  and  K  are  computed  recursively  as 
the  following: 


(9) 

(10) 

(11) 

(12) 

(131 


Since  't'  is  always  positive,  the  positive  definite  condition  of 
D  is  assured  by  Equation  (10) .  As  cancellation  type  errors  that 
may  happen  in  Equation  (3)  are  avoided  in  Equation  (10) ,  the 
accuracy  of  computation  is  enhanced. 
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In  our  previous  estimator-predictor  design,  three  types  of  target 
models  have  been  incorporated.  Type  1  is  a  constant  velocity  model. 
Type  2  is  a  first  order  Markov  acceleration  model  and  type  3  is 
a  second  order  acceleration  model  with  one  zero  and  two  poles. 

The  parameters  of  these  models  have  been  identified  with  real 
test  data  to  account  for  various  target  maneuvering  levels.  Noised 
corrupted  data  of  range  and  azimuth  angle  of  target  are  measurement 
inputs  to  the  three  parallel  extended  Kalman  filters  modified  with 
the  UD- factorization  scheme.  The  adaptive  prediction  comes  in 
when  one  of  the  filters  with  the  largest  likelihood  function  is 
selected  to  provide  estimates  for  gun  lead  prediction. 

Three  microprocessors  are  used  and  each  processes  one  filter.  The 
one  with  the  constant  velocity  model  is  the  master  board  which  also 
processes  the  filter  selection  and  gun  lead  predictions  by  virtue 
of  its  smaller  load  of  computational  burden.  The  communication 
control  and  data  transfer  between  the  master  board  and  the  other 
two  microprocessor  boards  will  be  described  in  the  next  section. 


MICROPROCESSORS  AND  THEIR  INTERFACE 

Three  Intel  86/12A  single  board  computers  (SBC)  are  used  to  process 
the  parallel  filters  simultaneously.  Each  has  32K  bytes  of  random 
access  memory  (RAM)  and  8K  bytes  of  electronic  programmable  read 
only  memory  (EPROM)  which  can  be  extended  to  32K  bytes .  If  more 
memory  is  needed,  extra  RAM  and  EPROM  boards  can  be  attached.  The 
memory  in  own  board  is  accessed  by  the  central  processing  unit 
(CPU)  of  the  board  through  the  local  bus .  Additional  memory  up 
to  one  megabyte  can  be  planned  and  accessed  through  the  system  bus. 

The  communication  between  the  keyboard  or  the  microprocessor  develop¬ 
ment  system  and  the  master  board  is  established  through  a  serial 
interface  cable.  Two  out  of  the  three  programmable  peripheral 
interface  input/output  ports  are  used  to  take  care  of  the  communi¬ 
cation  traffic  control  between  the  master  board  and  the  other  two 
boards . 

The  data  transfer  uses  the  multibus  interface  which  requires  only 
one  bus  clock  of  9.22  MHz  for  synchronized  communication  among  the 
SBC.  The  local  CPU  must  reside  in  that  part  of  its  OT-m  memory  which 
has  not  been  assigned  as  dual  port  RAM  inside  the  megabyte  addres¬ 
sing  plan  when  another  CPU  actually  accesses  the  dual  port  RAM 
area. 

The  86/12A  single  board  computer  which  has  twelve  16-bit  registers 
performs  floating  point  computations  with  the  help  of  a  floating 
point  mathematics  library  simulating  32-bit  operation.  The  test 
chip  of  8087  coprocessor  was  made  available  for  development  on 
August  1981,  This  coprocessor  which  can  be  attached  to  the  86/12A 
computer  board  easily  has  eight  80-bit  registers  capable  of  per- 
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forming  32-  and  64-bit  floating  point  multiplication  with  very 
high  speed  such  as  tens  of  microseconds . 


I^IPLEMENTATION 

After  the  proper  interfacing  of  the  microprocessors,  some  real 
target  paths  of  various  noise  statistics  are  selected.  The  function 
of  the  entire  set-up,  the  prediction  estimates  and  the  processing 
time  of  the  second  order  algorithm  in  this  system  are  investigated. 

a.  A  rather  linear  portion  of  a  target  path  is  used  to  verify  the 
proper  functioning  of  the  entire  microprocessor  set-up.  A  similar 
program  is  run  on  an  IBM  360  computer  to  obtain  results  for  com¬ 
parison. 

b.  A  segment  of  real  target  path  data  with  an  average  maneuvering 
noise  level  1.176  meters  per  second  and  an  average  speed  13  miles 
per  hour  is  corrupted  with  random  Gaussian  noise  of  3  meters  in 
range  measurement  and  0.3  milliradians  in  angle  measurement.  These 
corrupted  data  is  sampled  at  10  s/s  as  input  to  the  system  and  the 
prediction  estimates  are  compared  with  the  results  from  previous 
study  using  a  conventional  extended  Kalman  filter  v7ithout  modifi¬ 
cation. 

c.  With  the  same  data  the  system  performance  is  evaluated  for  pro¬ 
cessing  the  data  at  5  s/s  instead  of  10  s/s. 

d.  The  processing  times  for  the  floating  point  multiplication, 
division  and  square  root  operation  with  the  86/12A  CPU  are  compared 
to  that  with  the  8087  numerical  coprocessor. 

e.  The  actual  processing  time  of  the  second  order  algorithm  with 
a  8087  coprocessor  test  unit  is  examined.  The  number  of  measure¬ 
ments  processed  in  10  seconds  is  noted. 


RESULTS  AND  CONCLUSIONS 

Under  the  implementation  conditions  in  the  previous  section,  results 
are  summarized  as  follows : 

a.  Results  from  IBM  360  and  Intel  86/12A  SBC  show  a  1.2  percent 
or  less  difference  in  lead  angle  estimates  and  much  less  in  impact 
range  estimates.  The  proper  functioning  of  the  multi-microprocessor 
set-up  is  verified. 

b.  From  previous  study  for  the  given  target  path  in  implementation 
b  and  averaging  over  seventeen  points,  the  estimated  prediction 
errors  in  milliradians  are  1.29,  1.72  and  0.91  for  constant  velocity, 
first  order  acceleration  and  second  order  acceleration  filter  types 
respectively.  From  the  multi-microprocessors  under  the  same  con¬ 
ditions,  the  estimated  prediction  errors  in  milliradians  are  1.2, 

1.49  and  0.88  for  constant  velocity,  first  order  acceleration  and 
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second  order  acceleration  filter  types  respectively. 

The  accuracy  of  the  estimates  from  this  algorithm  are  highly  com¬ 
petitive  or  better  than  that  from  the  conventional  extended  Kalman 
filter  algorithm. 

c.  With  a  sampling  rate  of  5  s/s  the  estimated  prediction  errors 
in  milliradians  are  1.16,  1.54  and  0.9  for  constant  velocity,  first 
order  acceleration  and  second  order  acceleration  filter  types,  res¬ 
pectively.  The  change  in  performance  is  about  3  percent. 

d.  Averaging  over  ten  thousand  iterations,  the  floating  point 
operation  of  the  command  group  LOAD,  MUL,  STORE  and  WAIT  takes  4 
milliseconds  with  the  86/12A  CPU  while  the  same  takes  63.6  micro¬ 
seconds  with  the  8087  coprocessor.  It  is  a  63  times  faster  in 
multiplication  with  the  latter. 

For  the  command  group  LOAD,  DIV,  STORE  and  WAIT,  the  floating  point 
operation  takes  5.5  milliseconds  with  the  86/12A  CPU  and  85.6 
microseconds  with  the  8087  coprocessor.  It  is  64-<times  faster 
in  division  with  the  latter . 

For  the  command  group  LOAD,  SQP%.T,  STORE  and  WAIT,  the  86/12A  CPU 
takes  40  milliseconds  to  process  a  square  root  procedure  while  the 
8087  coprocessor  with  its  micro  program  for  SORT  takes  77.6  micro¬ 
seconds.  It  is  515  times  faster  with  the  latter. 

e.  With  a  test  unit  of  8087  coprocessor  planted  in  the  86/12A 
board,  the  second  order  algorithm  is  processed  for  10  seconds . 

Only  88  sets  of  measurements  are  processed.  This  indicates  that  a 
12  percent  improvement  in  speed  is  needed  to  do  10  samples  per 
second  real  time  processing.  Fine  tuning  the  program  code  or 
changing  the  programming  language  to  assembler  type  may  help  elim¬ 
inate  this  time  lag. 

In  all,  the  impressive  ntimerical  characteristics  of  the  UD-factor- 
ization  deserves  our  attention.  It  has  been  encouraging  to  know 
that  microprocessor  technology  has  caught  up  in  speed  and  flexi¬ 
bility  to  process  advanced  parallel  algorithms  with  heavy  load 
of  computation.  We  see  that  great  many  applications  of  micropro¬ 
cessor  to  multiple  input,  multiple  output  and  parallel  processing 
are  forthcoming . 
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ON  THE  CREDIBILITY  OF  MODELS 
Naim  A.  Khelr 

School  of  Science  and  Engineering 
The  University  of  Alabama  in  Huntsville,  Huntsville,  AL  35899 

Model  Credibility  is  becoming  increasingly  important  in  today's 
world  of  complex  systems.  One  reason  is  due  to  the  awareness  and 
discretion  being  exercised  on  spending  on  simulations;  a  second 
reason  might  be  the  fact  that  more  and  more  models  and  simulation 
results  are  being  used  in  decision-making.  Models,  whether  repre¬ 
senting  the  broad  spectrum  of  engineering  applications  (energy, 
power,  aerospace,  military  systems,  etc.)  or,  say,  biological 
systems  such  as  that  of  blood  circulation  would  require  the  same 
attention  in  terms  of  model  credibility.  More  recently,  reliance 
on  valid  models  has  become  a  necessity  in  areas  such  as  energy 
forecasting, (energy  models  and  their  integrity  have  been  the 
subject  of  many  congressional  hearings  since  1975).  Thus,  the 
ultimate  use  of  a  model  is  for  decision-making  and  its  final  eval¬ 
uation  is  in  terms  of  the  decisions  being  made. 

Three  major  areas  appear  to  be  the  focus  of  progress,  or  lack  of 
it,  in  arriving  at  a  higher  level  of  agreement  among  simulatlonists 
on  model  credibility  and  what  it  is  all  about.  These  areas  are 
related  to: 

(i)  Terminology 

(ii)  Issues 

(lii)  Qualifying  Measures 

The  question  of  Terminology  has  been  the  subject  of  many  previous 
efforts  by  the  Technical  Committee  on  Model  Credibility  (TCMC)  of 
the  Society  of  Computer  Simulation.  Ref.  (1)  summarizes  a  complete 
set  of  definitions  beginning  with  the  understanding  and  distinction 
between  Reality,  Conceptual  Model,  and  Computerized  Model,  and 
proceeding  to  introduce  the  important  concepts  of  Model  Verification 
and  Model  Validation.  The  document  also  introduced  statements  on 
model  qualification,  domain  of  applicability,  certification  and 
documentation.  The  definitions  adopted  by  the  TCMC  on  verification 
and  validation  are;  "Model  Verification:  The  substantiation  that 
a  Computerized  Model  represents  a  .Conceptual  Model  within  specified 
limits  of  accuracy.  Model  Validation:  The  substantiation  that  a 

Computerized  Model  within  its  domain  of  applicability  possesses  a 
satisfactory  range  of  accuracy  consistent  with  the  intended  appli¬ 
cation  of  the  model."  However,  all  too  frequently  in  process 
modeling  the  concepts  of  verification  and  validation  are  reduced, 
respectively,  to  simply  the  tasks  of  debugging  and  input-output 
comparison  of  model  and  system  data.  One  should  note  that  this 
terminology  was  arrived  at  with  the  intent  that  it  could  be 
employed  in  all  types  of  simulation  applications.  Thus,  adherence 
to  the  terminology  is  highly  recommended  to  facilitate  communica¬ 
tion  between  simulation  developers,  users  and  decision-makers. 
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For  example,  it  Is  simple  to  recognize  the  difficulty  in  communi¬ 
cation  between  a  mathematically-oriented  modeler  and  a  linguisti¬ 
cally-oriented  public  policy-maker . 

A  work  of  caution  is  in  order,  even  if  one  adheres  to  the  above 
terminology  and  arrives  at  a  validated  model,  one  may  be  tempted 
to  make  an  obvious  and  simple  change  in  the  model.  Since  it  is 
easy  to  err,  revalidation  of  the  model,  for  the  same  reasons  that 
validation  was  required  in  the  first  place,  is  necessary  whenever 
any  modification,  even  an  apparently  minor  one,  is  made.  Thus, 
once  having  demonstrated  the  model  validity,  it  should  be  con¬ 
sidered  INVIOLABLE.** 

In  order  to  be  able  to  judge  the  adequacy  of  a  model  for  its  in¬ 
tended  purposes,  the  TCMC  is  engaged  in  a  discussion  of  an  under¬ 
standing  of  important  Issues  including  these: 

(a)  The  real  purpose  of  a  credibility  measure  -  is  it  to  make 
a  decision  to  use  a  '’more-credible  model"  at  perhaps  greater  cost 
versus  a  "less-credible  model"  at  a  lower  cost?  Is  it  to  estab¬ 
lish  a  confidence  or  reliability  factor  for  the  model,  i.e.,  X% 
reliable  for  Y%  confidence? 

(b)  In  modeling  reality  in  terms  of  ultimate  simplicity  (a 
more  detailed  model  is  not  necessarily  a  better  model),  an  issue 

is  that  of  "simple"  versus  "complex"  models  and  cost  considerations. 
Simple  models  have  the  added  merit  of  closing  communication  gaps 
between  "non- technical"  decision-makers  and  modelers. 

(c)  Should  a  credibility  measure  be  ’’absolute"  defining  the 
agreement  of  dynamic  computer  simulation  model  (Computerized  Model) 
with  actual  system  (Reality),  or  "relative"  (model  versus  model)? 

(d)  How  to  keep  simulations  under  configuration  control  so 
that  credible  ones  remain  credible. 

(e)  The  absence  of  an  effective  grasp  of  how  to  achieve  par¬ 
simony  in  modeling  makes  it  virtually  impossible  to  develop  an 
efficient,  understandable,  credible  model  of  a  large-scale  system. 

Next  we  focus  on  the  real  question  of  which  Qualifying  Measure  (s) 
one  may  adapt  to  express  the  credibility  of  a  model.  Basically, 
the  objective  from  using  a  measure  is  to  quantify  how  well  a  model 
matches  the  performance  of  reality  being  modeled.  Comparison  of 
time  trajectories  from  the  model  and  the  instrumented  system  is 
often  used  (data  collected  from  an  actual  system  may  be  rare  and/ 
or  expensive  to  get  while  simulation  results  may  be  abundant). 

A  summary  of  measures  used  in  previous  studies  follows: 

(a)  Theil  Inequality  Coefficient  (TIC)  was  recommended  in 
regards  with  missile  systems  validation  (2)^ 

**  Personal  discussion  with  Dr.  Lester  H.  Fink,  of  Systems  Engi¬ 
neering  for  Power,  Oakton,  Virginia. 
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(b)  A  modified  TIC  was  applied  to  validation  study  with  sparse 
random  data  (6) , 

(c)  In  addition  to  TICr  correlation  coefficient  and  a  simi¬ 
larity  coefficient  were  recommended  in  Ref.  (3), 

(d)  In  a  recent  study (5),  cross-correlation  coefficient  and 
Bayesian  updating  were  suggested  in  a  credibility  investigation 
of  an  RF  environmental  model, 

(e)  Use  of  FFT  (Fast  Fourrier  Transform)  to  obtain  frequency 
domain  error  measure  of  a  trajectory  was  introduced  in  a  boiler- 
turbine-generator  system^  study(^), 

(f)  Also  in  Ref.  (4),  digital  filtering  technique  was  employed 
to  extract  frequency  bands  of  an  error  signal;  the  band-limited 
measures  of  signals  could  be  further  analyzed. 

(g)  Error  trajectory  could  be  qualified  in  terms  of  measures 
such  as  the  integral  of  Its  square  and/or  the  largest  absolute 
value . 
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A  LINEAR  NONLINEAR  CONTROL  PROBLEM 
Leon  Kotin 

Center  for  Tactical  Computer  Systems 
US  Army  Communications-Electronics  Command,  Fort  Monmouth,  NJ  07703 

INTRODUCTION 

Many,  perhaps  most,  of  the  ordinary  differential  equations  of 

applied  mathematics  are  second-order  linear  equations 
9  2 

d  x/dt  +a(t)dx/dt  +  b(t)x  =  0  or  their  nonhomo geneous  variants. 
Their  importance  is  indicated  by  the  famous  names  given  to  such 
equations:  Bessel,  Legendre,  Hill,  Hermite,  Airy  and  Laguerre,  to 
name  just  a  few.  By  an  elementary  transformation,  such  equations 
can  be  put  into  the  form  of  the  following  system 

x'  =  dx/dt  =  p(t)y,  y'  =  dy/dt  =  q(t)x.  (1) 

We  shall  consider  this  system  where  the  control  vector  (p,q)  belongs 
to  the  class  of  piecewise  continuous  vector  fimctions  of  t  with 
bounded  norm,  say 

p2  +  q2  =  1.  (2) 

We  shall  determine  that  control  for  which  a  nontrivial  integral 
curve  or  trajectory  initially  on  the  x-axis  reaches  the  y-axis  in 
least  possible  time.  We  shall  also  determine  this  shortest  time 
and  the  corresponding  geodesic  (x(t),  y(t)). 

We  remark  that  by  changing  the  time  scale,  we  can  see  that  no 
generality  was  lost  by  setting  the  boimd  of  the  norm  of  (p,q) 
equal  to  unity. 

This  paper  is  a  detailed  exposition  of  results  which  were  obtained 
in  collaboration  with  G.  Birkhoff  in  the  broader  context  of 
elliptic  autonomous  families  of  differential  systems;  these 
appear  in  concise  form  in  [2]. 
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The  title  of  this  paper  derives  from  the  fact  that  although  (1)  is 
a  system  of  linear  differential  equations  in  x  and  y  and  is  linear 
in  the  control  variables  p  and  q,  it  is  not  linear  jointly  in 
(x,y)  and  (p,q). 


THE  MINIMUM  TIME 

From  (1)  and  (2),  we  see  that  for  any  trajectory, 

2  .  2  ,2,2.  ,2/  2  <  1 

p^  +  q  =  x'  /y  +  y'  /x  =  1 


(3) 


or 

1/y^  +  (dy/dx)^/x^  =  (dt/dx)^.  (A) 

Consequently,  the  time  t  required  for  a  trajectory  initially  at  the 
point  (xQ,yQ)  to  go  to  (x,y)  satisfies 

t  =  P  (1/y^  +  (dy/dx)^/x^)'^dx.  (5) 

^o 

To  evaluate  this  integral,  we  introduce  the  modified  polar 
coordinates  ^  and  X  defined  by 

g2(|)  ^  j^2  ^  y2^  ^  ^  (5) 

whence 

X  =  y  =  leVd+x^)"*.  (7) 

We  note  that  (j)  and  X  are  related  to  the  standard  polar  coodinates 
r  and  0  by  the  equations  (|)  =  log  r  and  X  =  arctan  0. 

In  order  to  compute  the  integral  in  (5)  we  first  obtain,  by  the 
chain  rule, 

dx  =  e'^d+X^)'^^^  [(l+X^)d<}.-XdX]  , 
dy  =  e‘*’(l+X^)"^^^[X(l+X^)d<|)+dX]. 
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Then 


Furthermore,  from  (6)  and  (7) 

2 

dA  =  (x  dy  -  y  dx)/x 

=  e"**’(l+X^)^/^[(l+X^)(J.'  -X]'^  dx.  (10) 

After  replacing  these  in  (5),  the  integral  in  terms  of  ({>  and  X 
becomes  /F(x  ,  (j) '  )dx  ,  where 

=  X  (  x^+1 )  t  (  ^  ^+1 )  ^  ^  ^+1 )  <J> '  ^  +  2X(X^-1)(()'  +2X^]^.  (11) 

The  bracketed  quadratic  expression  in  41 '  is  a  minimum  when 

4.'  =  -X(X^-l)/(X^+l)(xVl).  (12) 

Substituting  this  into  the  integrand  F(X,4)')  and  integrating 
from  X  =  0  to  <»,  we  obtain  the  minimum  time  T  in  going  from 
the  x-axis  to  the  y-axis.  This  is  found  to  be,  through  the  use 
of  the  beta  function, 

T  =/o  dx/(x^  +1)^  =  r^(!5)/4TT^  =  1.85407...  (13) 

([1],  p.  258;  [4],  pp.  254,524).  We  shall  show  later  that  this 
value  is  actually  attained.  We  state  this  as  Theorem  1: 

The  smallest  t-interval  between  a  zero  of  y  and  zero  of  x  for  a 
nontrivial  solution  (x,y)  of  (1)  -  (2)  is  given  by  T  in  (13). 

THE  GEODESICS  AND  CONTROLS 

By  definition,  the  geodesics  are  the  solution  of  (1)  -  (2)  which 
yield  the  minimum  time  T  in  (13).  This  time  T  is  realized  by  the 
trajectories  of  (l)-(2)  which  satisfy  the  differential  equation 
(12).  Although  this  equation  is  separable,  solving  it  in  its 
present  form  involves  some  complicated  computation.  It  is  easier 
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to  rewrite  (12)  in  terms  of  x  and  y  alone.  Using  (6),  (9)  and 

3  3 

(12),  after  some  simple  algebra  we  obtain  dy/dx  =  -x  /y  from 
(12);  this  integrates  to  the  implicit  form  of  the  solution 

x^  +  y^  =  const.  (14) 

By  linearity,  we  may  take  const.  =  1,  getting 

x^  +  y^  =  1.  (15) 

To  obtain  the  explicit  form  of  the  solution  (15),  we  first  invoke 
the  bang-bang  property;  i.e. ,  the  minimizing  solution  will  corre¬ 
spond  to  a  control  (p,q)  which  is  on  the  boundary  of  the  control 
region  in  the  (p,q  )  -plane.  Then  the  inequality  (2)  becomes  the 
equation 

p^  +  q^  =  1.  (16) 

3  3 

Differentiating  (14)  with  respect  to  t,  we  get  x  x'  +  y  y'  =0 
or,  using  (1),  px^  +  qy^  =  0.  But  from  (14)  and  (16),  this  yields 

p2  =  y^  (17) 

whence  x'  =  py  =  +(l-x^)^^^. 

We  shall  consider  the  trajectory  (15)  which  is  moving  clockwise 
and  which  is  initially  on  the  line  y  =  x  and,  with  no  loss  in 
generality,  in  the  first  quadrant,  so  that  x^  h  x(0)  =  2  Then 

x'  =  -d-x^)^/^  (18) 

whence 

-t  =  f  ==  (l-xS-3/''dx.  (19) 

’'O 

To  evaluate  this  integral  in  closed  form,  we  introduce  a  series 
of  substitutions  which  telescope  to 

x''  -  >5(1  -  (l-w^)"*).  (20) 


76 


Then  (19)  becomes 


-2  (w(l-w^)  )“'^dw.  (21) 

In  terms  of  Legendre's  canonical  form  [3,  p.  59] 

F((())  H  F((l),2’^)  =  (1  -  J5sin^v)“'^dv,  (22) 

t>y  [3,  pp.  78,  163]  (20)  becomes 

.2-y\  .  2^(F(*)  -F(./2))  -  2‘*/w2<l  -  %=ln^v)-’‘dv  (23) 

where 

w  =  (1  -  cos^(i))/(l  +  cos^4)) 

=  (1  -  cn^F((t)))/(l  +  cn^F((^))  (24) 

[3,  pp.  59,  163]  in  terms  of  the  Jacobian  elliptic  faction  cn  F  = 
cn  (F,2"^).  From  (23),  F((}))  =  -2t  +  F(Tr/2).  Letting  K  e  F(Tr/2), 
by  the  addition  formula  [4,  p.  497]  for  cn(u-v)  in  terms  of  the 
other  Jacobian  elliptic  functions  sn  and  dn,  and  by  the  values  sn  K 
=  1,  cn  K  =  0  and  dn  K  =  2”'^  [4,  p.  499],  we  can  show  from  (24) 
that 

w  =  cn^(2t).  (25) 

Applying  this  to  (20)  gives  us 

=  Js[l-(l-cn^(2t))^] .  (26) 

Then  by  (14)  becomes 

y^  =  i5[l+(l-cn^(2t))^].  (27) 

The  trajectory  given  by  these  two  equations  is  initially  on  the 
line  y  =  X  in  the  positive  quadrant  and  reaches  the  y-axis  when  t 
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is  the  smallest  positive  zero  of  cn(2t,2  '®);  this  is  =  r^(!^)/8Tr'^ 

[4,  p.  524]  (cf.  eq.  (13)  above).  To  determine  the  entire  tra¬ 
jectory  in  the  first  quadrant,  we  simply  reflect  this  trajectory 

through  the  line  y  =  x.  Therefore,  because  cn(2t)  is  an  even 

4  4 

function,  for  -T  <  t  <  0,  x  and  y  are  given  by  the  right-hand 
side  in  (27)  and  (26)  respectively. 

The  corresponding  control  variables  p  and  q  can  now  be  obtained 
immediately.  From  (17),  p  =  +y  ,  but  the  negative  sign  is  chosen 
by  (18);  similarly,  q  =  x  when  y  >  x  >  0. 

This  is  all  summarized  in  the  following  result,  which  incorporates 
Theorem  1. 

Theorem  2.  The  smallest  t- interval  between  a  zero  of  x  and  a 

2  "k 

zero  of  y  for  a  nontrivial  solution  of  (l)-(2)  is  r  (Js)/4it  = 
1.85407  ...  .  The  corresponding  geodesic,  normalized  by  (15), 
and  control  functions  are  given  by 

x2  =  q  =  [k(l  -  [l-cn^(2t)]^)]^ 

y2  =  .p  =  [jgd  +  [l-cn^(2t)]^)]''^ 

when  y  -  X  -  0. 
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ABSTRACT 

Application  of  modern  estimation  and  control  techniques  to  the 
tracking,  stabilization,  and  pointing  functions  of  a  mobile  air 
defense  system  is  addressed  in  this  paper.  A  multiple-model 
estimator  structure  is  applied  to  tracking  tactical  missile 
targets.  Modeling,  design,  and  performance  evaluation  of  an 
instrument  platform  stabilization  system  to  reject  terrain- 
induced  errors  dre  outlined.  An  optimal  predictor  for  ballis¬ 
tic  lead  angle  compensation  is  formulated;  its  prediction  error 
characteristics  and  propagation  of  these  errors  through  the  gun 
pointing  controller  are  derived.  Future  areas  of  development 
and  field  test  validation  for  the  concepts  investigated  are 
described. 


INTRODUCTION 


BACKGROUND 

Performance  requirements  of  modern  mobile  air  defense  systems 
have  placed  stringent  specifications  on  the  tracking,  stabili¬ 
zation  and  pointing  performance  of  these  systems.  The  tracking 
subsystem  must  accurately  estimate  the  position  and  velocity 
of  a  highly-maneuvering  target  without  explicit  knowledge  of 
the  target's  maneuvering  capabilities  or  intended  trajectory. 
The  stabilization  system  must  reject  vibration  disturbances  to 
the  tracking  sensor  assembly  and  gun  pointing  system  due  to  gun 
recoil  and  vehicle  travel  over  a  wide  range  of  terrain.  The 
gun  pointing  system  is  limited  by  the  extrapolation  accuracy 
of  a  target  trajectory  predictor  and  by  the  dynamic  capabili¬ 
ties  of  the  gun  gimbal  actuation  system.  These  demanding  sys¬ 
tem  requirements  call  for  correspondingly  powerful  design  and 
analysis  tools. 

Modern  control  methodologies  offer  a  systematic  framework  of 
synthesis  and  analysis  techniques  for  air  defense  fire-control 
system  design.  Modern  estimation/control  techniques  were  de¬ 
veloped  specifically  for  multiple  input-output  systems  which 
potentially  are  stochastic,  nonlinear,  and  uncertain  (i.e.. 
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the  dynamics  of  the  system  and  disturbances  are  not  known  pre¬ 
cisely);  all  of  these  considerations  are  present  in  an  air 
defense  fire-control  system. 

The  goal  of  the  study  described  here  was  to  assess  the  poten¬ 
tial  benefit  of  using  modern  control  techniques  in  designing  a 
fire-control  system  for  a  representative  air  defense  system. 
Using  mathematical  models  of  the  Gun  Low  Altitude  Air  Defense 
System  (GLAADS),  a  number  of  modern  estimation  and  control  con¬ 
cepts  were  developed  and  analyzed.  The  concepts  investigated 
and  a  summary  of  results  are  described  in  the  following  section. 

SYSTEM  OVERVIEW 

Figure  1  is  a  block  diagram  of  the  air  defense  fire-control 
system  concept  developed  in  the  present  study.  For  purposes 
of  demonstrating  new  estimation  and  control  concepts  the  system 
is  restricted  to  planar  engagements,  i.e.,  only  range  and  ele¬ 
vation  pointing  and  tracking  are  considered. 

The  integrated  system  is  comprised  of  tracking,  pointing,  and 
stabilization  subsystems.  The  tracking  subsystem  generates 

estimates  of  the  target  range  (R),  elevation  (6^.)  and  elevation 


R-4S321 


STABILIZATION 


TRACKING 


POINTING 


Figure  1  Integrated  Fire-Control  System 
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rate  (6^.)  from  noisy  range  and  elevation  measurements.  The 

estimates  generated  by  the  tracking  subsystem  are  routed  to 
the  pointing  system  where  these  quantities  are  extrapolated 
through  the  estimated  projectile  time-of- flight  (T^)  by  a  pre¬ 
diction  algorithm  (i.e.,  the  ballistic  lead  angle  is  computed). 
The  extrapolated  elevation  and  elevation  rate  are  then  used  by 
a  feedforward  controller  to  command  the  gun  pointing  angle  and 
slew  rate;  the  dynamic  response  of  the  gun  itself  is  regulated 
by  a  feedback  controller.  The  instrument  platform  stabiliza¬ 
tion  system  serves  the  dual  function  of  slaving  the  instrument 
platform  attitude  to  the  current  target  elevation  (via  feed¬ 
forward  control)  and  rejecting  terrain  induced  disturbances 
(via  a  feedforward- feedback  controller). 

Modern  control  concepts  applied  to  the  three  subsystems  and 
evaluated  in  the  present  study  include: 

•  A  tracking  filter  based  on  multiple 
model  estimator  structure 

•  Statistical  linearization  of  the  non- 
.  linear  target  tracking  dynamics 

•  An  optimal  one-step  prediction  algo¬ 
rithm  for  extrapolating  target  motion 
estimates  over  the  projectile  time- 
of-  flight 

•  Command-generator- tracker  (CGT)/optimal 
regulator  control  of  the  gun  pointing 
dynamics 

•  A  robust  (i.e.,  insensitive  to  varia¬ 
tions  of  terrain  type)  instrument  plat¬ 
form  stabilization  system  designed  by 
optimal  estimation/control  techniques. 

Applications  of  these  concepts  to  the  GLAADS  vehicle  and  per¬ 
formance  benefits  derived  from  them  are  discussed  in  the  fol¬ 
lowing  sections. 


TRACKING  FILTER  SUBSYSTEM 

Through  application  of  multiple-model  structuring  and  statisti¬ 
cal  linearization,  an  accurate  and  robust  tracking  filter  is 
designed.  This  filter  produces  unbiased  estimates  of  the  tar¬ 
get  motion  with  range  and  elevation  errors  substantially  lower 
than  the  noise  levels  of  the  range  and  elevation  sensors.  It 
is  shown  that  the  multiple  model  filter  obviates  the  need  for 
precise  knowledge  of  the  target  maneuver  model;  i.e.,  the  fil¬ 
ter  performs  well  given  only  knowledge  of  the  expected  range  of 
target  maneuver  dynamics.  The  formulation  and  performance  of 
the  tracking  filter  is  described  in  the  following  subsections. 
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TRAJECTORY  AND  SENSOR  MODELS 


The  variables  used  to  describe  the  target  trajectory  and  meas¬ 
urements  are  shown  in  Fig.  2.  The  target  position  is  defined 
by  the  target  altitude  (y)  and  horizontal  distance  from  the 
gun  (x).  The  instantaneous  target  velocity  vector  (v)  is  char¬ 
acterized  by  the  target  airspeed  (V)  and  flight  path  angle 
(y).  The  maneuver  characteristics  of  the  target  are  described 
by  the  flight  path  angle  rate  (6  =  lu).  Measurements  of  two 
polar  components  of  target  position,  range  (r)  and  elevation 
(9),  are  available  to  the  fire  control  system. 


Figure  2  Engagement  Geometry 

The  dynamic  equations  of  the  target  trajectory  and  the  equa¬ 
tions  relating  the  target  state  to  the  tracking  instrument 
measurements  are  listed  in  Table  1.  The  notation,  N(m,v), 
denotes  white  noise  of  mean,  m,  and  variance,  v. 

The  maneuver  characteristics  of  the  target  vehicle  are  embodied 
in  the  flight-path-angle-rate  (w)  equation;  as  discussed  in 
Ref.  1,  the  flight  path  angle  rate  is  directly  related  to  the 
normal  acceleration  of  the  vehicle  which  is  the  usual  command 
variable  for  maneuvering  manned  aircraft  and  tactical  missiles. 
The  parameters  that  govern  the  dynamics  of  flight  path  angle 
rate,  t  and  q,  are  never  precisely  known  in  any  real  engagement 
hence,  the  dynamics  shown  in  Table  1  are  both  stochastic  and 
uncertain . 
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TABLE  1 

SUMMARY  OF  TRAJECTORY  AND  MEASUREMENT  EQUATIONS 


TRAJECTORY  DYNAMICS 


X 

V  cos  y 

Xi 

• 

y 

=: 

V  sin  y 

= 

^2 

• 

V 

= 

-g  sin  y  - 

• 

Y 

=; 

U) 

= 

^4 

# 

1 

U) 

-  —  U)  +  w 

Xrr 

X 

5 

w  ~  N(0,q) 

RANGE  MEASUREMENT 

~  N(0,4  m^) 

ELEVATION  MEASUREMENT 

=  tan"^  (y/x)  +  Vg  ;  Vg  ~  N(0,0.25  mrad^) 


The  thrust/drag  balance  of  the  vehicle  is  quantified  by  the 
parameter  f^;  for  example,  in  an  unpowered  segment  of  a  tac¬ 
tical  missile  trajectory  fj^  relates  the  drag  of  the  vehicle  to 
the  vehicle  airspeed.  The  drag  parameter,  fj^,  is  usually  un¬ 
known  in  a  real  engagement,  but  its  range  of  values  can  be 
estimated;  hence  fj^  is  an  uncertain  but  bounded  parameter. 

The  range  and  elevation  measurements  are  related  to  the  tra¬ 
jectory  states  by  nonlinear  equations.  The  rms  levels  of  sen¬ 
sor  noise  listed  were  taken  from  Ref.  2  and  represent  the  per¬ 
formance  of  state-of-the-art  instrumentation. 

FILTER  FORMULATION 

The  target  trajectory  dynamics  and  measurement  models  listed 
in  Table  1  are  stochastic,  nonlinear,  and  uncertain  (i.e.,  the 
model  parameters  are  not  precisely  known).  Due  to  the  non¬ 
linear  and  uncertain  properties  of  the  trajectory  and  measure¬ 
ment  models  the  standard  linear  optimal  estimator  (Kalman  fil¬ 
ter)  cannot  be  directly  applied;  in  the  present  study,  two 
modifications  of  the  standard  Kalman  filter  structure  were 
employed  to  accommodate  nonlinearity  and  uncertainty: 

•  A  statistically-linearized  formulation 
of  the  Kalman  filter  to  accommodate 
nonlinearities 
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•  A  multiple  model  estimator  structure 
to  accommodate  the  uncertainty  of  the 
model  parameters . 

The  structure  of  a  statistically-linearized  filter  is  shown  in 
Fig.  3.  The  structure  is  substantially  similar  to  the  linear 
Kalman  filter;  it  consists  of  a  state  estimator  that  generates 
estimates  of  the  trajectory  state,  using  a  model  of  the 

dynamics  (x=f(x)),  the  measurement  model  (z=h(x))  and  the 
tracker  measurements,  Zj^,  and  a  error  covariance  (P)  propagator 

The  major  differences  between  the  statistically-linearized 
filter  and  an  extended  Kalman  filter  are  that  expected  values 

of  the  state  derivative  (f(x))  and  measurement  (h(x) )  are  used 
rather  than  direct  functions  of  the  estimates  (i.e.,  jf(x)  and 
h(x))  and  statistical  describing  functions  (Ref.  3)  for  the 
linearized  dynamics  (N)  and  measurements  (M)  are  used  rather 
than  Taylor  series  approximations  (the  mathematical  derivations 
of  the  expected  values  and  describing  functions  are  detailed 
in  Ref.  1).  At  the  expense  of  some  additional  calculations, 
the  statistically-linearized  filter  will  yield  unbiased  esti¬ 
mates  of  the  nonlinear  dynamics  and  avoid  filter  divergence 
problems  typically  encountered  in  applications  of  extended 
Kalman  filters. 

R-668a3 


Figure  3  Statistically-Linearized  Kalman  Filter 
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Uncertainty  of  the  dynamic  model  parameters  is  accommodated 
through  a  multiple-model  estimator  structure.  Figure  A  is  a 
functional  diagram  of  a  two-model  estimator  structure,  as  it 
would  be  applied  to  an  air  defense  tracking  system. '  Here  the 
range  and  elevation  measurements  (z)  are  processed  by  two  fil¬ 
ters.  The  choice  of  the  parameters  and  (correlation 

time  and  process  noise  level  of  the  flight  path  angle  rate 
dynamics)  are  chosen  to  represent  likely  target  maneuver  char¬ 
acteristics  in  the  individual  filters.  For  example,  a  choice 
of  large  and  small  (long  correlation  time,  low  normal 

acceleration  rms)  would  represent  a  level- flight  trajectory  in 
filter  1;  small  T2  and  moderate  Q2  in  filter  2  would  represent 

a  vertical  plane  "jinking"  maneuver.  The  dynamic  models  used 
in  the  two  filters  would  "bracket"  the  expected  range  of  target 
maneuvers  while  the  averaging  algorithm  would  combine  the  esti¬ 
mates  of  the  two  filters  to  tune  the  estimator  structure  to 
the  current  target  maneuver  level.  An  alternative  application 
to  the  dual  filter  structure  for  tracking  an  anti-radar  missile 
having  uncertain  but  bounded  drag  is  described  in  the  following 
section.  Mathematical  details  of  the  dual- filter  structure  in 
air  defense  system  applications  are  covered  in  Ref.  1. 


z 


R--64300 


Figure  4  Multiple  Model  Estimator  Structure 
FILTER  PERFORMANCE 

The  performance  of  the  dual  statistically-linearized  filter  in 
tracking  the  unpowered  midcourse  trajectory  of  anti-radar  mis¬ 
sile  is  described  here.  A  segment  of  the  example  trajectory 
is  shown  in  Fig.  5;  the  segment  shows  a  planar,  unpowered, 
non-maneuvering  gravity- turn  through  the  atmosphere. 

The  parameters  used  in  the  dual  filter  structure  were  chosen 
to  provide  an  adaptation  of  the  estimator  to  the  unknown  drag 
characteristics  of  the  target  vehicle.  The  maneuver  parameters, 
X  and  0^,  are  identical  for  the  two  filters  with  values  appro¬ 
priate  to  a  low  maneuver  level  gravity  turn.  The  drag  param¬ 
eter,  fjj,  is  set  to  zero  in  one  filter  and  to  a  value  2fjj  in 
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Figure  5  Anti-Radar  Missile  Midcourse  Segment 

the  other  filter,  where  is  an  estimate  of  the  vehicle  drag 

parameter  derived  from  intelligence  data  (Ref.  4).  The  two 
values  of  f^^  "bracket"  the  range  of  variation  of  fj^;  the  dual¬ 
filter  averaging  algorithm  then  combines  the  estimates  of  the 
two  filters  to  "tune"  to  the  current  value  of  fj^. 

The  performance  of  the  dual-filter  in  tracking  the  example 
trajectory  is  shown  in  Fig.  6.  Unbiased  errors  in  range  and 
elevation  estimates  are  obtained.  Substantial  improvement  of 
rms  range  and  elevation  error  over  the  measurement  accuracies 
is  achieved;  range  error  is  reduced  to  1  meter  rms  as  opposed 
to  2  meters  rms  measurement  error  and  elevation  is  reduced  to 
0.25  mrad  rms  as  opposed  to  0.5  mrad  rms  measurement  error. 

There  is  good  agreement  between  the  true  estimation  error  vari¬ 
ance  and  the  error  variance  computed  by  the  dual  filter,  thereby 
validating  the  use  of  statistical  linearization,  the  choice  of 
parameters  in  the  present  application,  and  the  viability  of  a 
dual  filter  structure  in  adapting  to  unknown  drag  characteris¬ 
tics  of  the  target. 


STABILIZATION  SYSTEM 

An  instrument  platform  stabilization  system  design  based  on  a 
Kalman  filter/optimal  feedforward- feedback  control  structure 
is  described  in  this  section.  Using  this  optimal  structure. 
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RANGE  ERROR  (m) 


RANGE  ERROR  HISTORY 


ELEVATION  ERROR  HISTORY 


6  ,  Anti-Radar  Missile  Tracking  Performance 

point  designs  for  various  levels  of  terrain- induced  vibration 
were  performed  and  analyzed.  Suboptimal  systems  were  then 
designed  and  their  performance  compared  with  that  of  the  opti¬ 
mal  designs;  by  this  means,  major  simplifications  of  the  system 
structure  were  shown  to  be  possible  with  little  degradation  of 

The  most  significant  of  these  sim- 
P^^f^^^tions  is  that  the  stabilization  system  can  be  decentral¬ 
ized  from  the  tracking  system. 

SYSTEM  DESIGN 


The  structure  of  the  stabilization  system  is  shown  in  Fig.  7. 
Using  the  platform  rate  integrating  gyro  and  hull  mounted  rate 
gyro  measurements,  a  full-order  Kalman  filter  (i.e.,  a  filter 
designed  using  the  coupled  dynamics  of  the  platform,  rate  in¬ 
tegrating  gyro,  and  terrain- induced  disturbance)  generates 

estimates  of  the  gyro  output  angle  (6g),  the  platform  eleva¬ 
tion  angle  and  elevation  rate  (0^  and  and  the  disturbance 

state  (aj^).  The  estimates  are  then  multiplied  by  a  control 


gain  matrix,  ,  to  compute  the  platform  torquer  and  gyro  com¬ 
mands  (u^  and  u  ) . 

e  g' 


The  optimal  control  gain  matrix,  K^ ,  is  partitioned  into  gains 

that  multiply  the  gyro  and  platform  states  and  gains  that  mul¬ 
tiply  the  disturbance  states,  i.e.,: 
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Figure  7  Structure  of  Instrument  Platform 
Stabilization  System 

With  this  partitioning,  the  control  system  can  be  described  in 
terms  of  feedback  and  feedforward  control  functions.  Desirable 
platform  dynamic  response  is  achieved  by  proper  values  of  the  K 
gains;  optimum  disturbance  rejection  is  accomplished  by  the  ^ 
gains . 

The  terrain  disturbance  model  used  in  the  present  study  was 
derived  from  data  presented  in  Ref.  5  and  is  described  in 
Ref.  1.  This  model  represents  travel  over  road  and  country 
terrain  at  speeds  from  5  to  25  km/hr. 

Th  noise  characteristics  of  the  rate  integrating  gyro  and  the 
hull  rate  gyro  are  given  in  Table  2.  These  noise  characteris¬ 
tics  are  typical  of  tactical  aircraft  quality  instruments. 

PERFORMANCE  AND  SENSITIVITY  ANALYSIS 

The  method  used  to  analyze  the  stabilization  system  performance 
was  steady-state  covariance  analysis.  The  covariance  matrix 
of  the  stabilization  system  is  derived  from  the  steady-state 
covariance  equation: 
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TABLE  2 

MEASUREMENT  NOISE  LEVELS 


INSTRUMENT 

RMS  NOISE 

BANDWIDTH 

RIG 

0.31  prad 

600  Hz 

Hull  Rate  Gyro 

2 . 9  mrad/sec 

600  Hz 

FX  +  XF^  +  Q  =  0 


(1) 


Here  X  is  the  covariance  matrix  of  the  system  which  consists 
of  subsets  related  to  the  covariances  of  the  platform  states, 
the  disturbance^states  and  the  state  estimator  errors.  The 
matrices  F  and  Q  are  the  dynamics  and  process  noise  matrices, 
respectively,  for  the  coupled  platform,  disturbance,  controller 
and  state  estimator  error  dynamics. 

The  performance  metric  chosen  in  the  present  case  is  the  steady 
state  variance  of  the  platform  elevation  angle;  performance 
sensitivity  analyses  yielded  the  following  significant  results: 

•  Stabilization  of  the  instrument  plat¬ 
form  to  3.3  |jrad  rms  over  the  entire 
range  of  terrain  and  vehicle  speed 
using  fixed  gains  (computed  for  worst 
case  design  point) 

•  Elimination  of  the  feedforward  branch 
of  the  controller  providing  a  major 
simplification  of  control  structure 
with  negligible  degradation  of 
performance 

•  Stabilization  error  rms  increases  by 
a  factor  of  4  with  hull  rate  gyro 
removed 

•  Platform  residual  stabilization  error 
is  sufficiently  low  (i.e.,  two  orders 
of  magnitude  lower  than  the  tracking 
elevation  sensor)  that  the  stabiliza¬ 
tion  and  tracking  functions  can  be 
decentralized. 


GUN  POINTING 

The  pointing  system  design  is  based  on  a  recently-developed 
type  of  feedforward  controller,  the  command  generator  tracker 
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(CGT) .  This  controller  causes  the  plant  output  (gun  pointing 
angle)  to  track  exactly  the  output  of  a  specified  deterministic 
linear  system;  in  tracking  stochastic  processes  (such  as  a 
maneuvering  target)  the  CGT  has  shown  performance  superior  to 
that  of  optimal  feedforward  controllers. 

SYSTEM  DESIGN 

The  structure  of  the  predictor/gun  pointing  system  developed 
in  this  chapter  is  shown  in  Fig.  8.  The  model  of  the  gun  ele¬ 
vation  dynamics  is  comprised  of  a  gain,  to  represent  the 

hydraulic  motor  command  response  and  inertia;  the  dynamics  of 
the  hydraulic  motor  valve  are  very  high  frequency  and  were 
therefore  neglected  in  the  present  design.  The  rotation  rate 
of  the  hydraulic  motor  is  geared-down  by  a  factor  of  1/n  in 
the  gear  box;  the  gun  elevation,  is  the  integral  of  the 

gun  rotation  rate. 


Figure  8  Predictor/Gun  Pointing  System  Structure 

Elevation  and  elevation-rate  estimates  from  the  tracking  filter 
are  extrapolated  forward  in  time  through  a  computed  projectile 

time-of-flight.  The  extrapolated  elevation  rate  (^^(t+t^))  is 

multiplied  by  a  feedforward  control  gain  (n/Ky)  to  drive  the 

gun  elevation  rate  to  match  the  target  elevation  rate.  The 

error  between  the  extrapolated  target  elevation  (6^(t+t£))  and 

the  gun  elevation  is  driven  to  zero  by  a  regulator  gain,  K^; 

the  magnitude  of  this  gain  determines  the  control  bandwidth  of 
the  closed-loop  gun  dynamics. 

PREDICTOR/POINTING  SYSTEM  ERROR  ANALYSIS 

The  mathematical  relationship  between  gun  pointing  error  and 
the  trajectory  state  prediction  errors  is  summarized  in  this 
section.  First,  the  predictor  elevation  and  elevation  rate 
errors  are  derived  in  terms  of  the  errors  in  the  extrapolated 
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trajectory  state.  The  gun  pointing  error  is  then  derived  from 
the  predictor  errors  and  the  dynamics  of  the  gun/controller 
system.  Figure  9  shows  the  geometrical  relationships  used  to 
derive  the  elevation  and  elevation  rate  errors. 


R-67539 


Figure  9  Geometrical  Relationships  for  Prediction 
Error  Analysis 

After  analyzing  the  differential  geometry  of  Fig.  9  (Ref.  1) 
one  obtains  the  following  expressions  for  the  predictor  errors: 


^  ^  ^  ^ 
e^  =  9  [e^/V  -  ej^/R  +  cot(0-Y)  (e^-e^ )  ] 


(3) 


where  the  notation  e^  denotes  the  prediction  error  for  variable 

a,  and  B  denotes  the  predicted  value  of  variable  B.  Equations 
2  and  3  relate  the  elevation  and  elevation  rate  errors  (e^  and 


ex),  which  corrupt  the  control  commands  to  the  gun,  to  the 

u  ^ 

predicted  trajectory  states  (x,  y,  V,  y)  and  predictor  errors 


V 


e^  ,  and  e^^) 


By  expressing  the  true  target  elevation  and  elevation  rate 
(which,  if  commanded  to  the  gun  pointing  system  would  yield 
exact  target  tracking)  in  terms  of  their  predicted  values  and 
prediction  errors;  i.e.. 


0?  ®T  ®0 


(A) 
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(5) 


# 

e 


T 


the  following  equation  is  derived  (Ref.  1)  for  the  gun  pointing 
errors : 


e 

gun 


+ 


(6) 


where  e  is  the  gun  pointing  error  and  t  (-n/(Kjj^K^) )  is  the 

time-constant  of  the  closed-loop  gun  regulator  dynamics.  Equa¬ 
tion  6  defines  the  dynamics  of  the  gun  pointing  error  in  re¬ 
sponse  to  errors  of  the  extrapolated  elevation  and  elevation 
rate.  As  indicated  by  Eq.  6,  the  gun  pointing  errors  are  fre¬ 
quency  limited  by  the  bandwidth  (l/t)  of  the  gun  regulator. 

The  static  gain  of  elevation  error  to  gun  error  is  unity;  the 
static  gain  of  elevation  rate  error  to  gun  error  is  x . 

Equations  2,  3  and  6  could  be  used  in  a  Monte-Carlo  simulation 
to  determine  the  time-history  and  statistics  of  the  gun  point¬ 
ing  error  in  a  particular  air-defense  scenario.  The  predictor 
errors  would  be  computed  as  the  difference  between  the  predicted 
elevation  and  elevation  rate  and  the  elevation  and  elevation 
rate  of  the  truth  model  trajectory.  A  linear  combination  of 
the  computed  predictor  errors  (i.e.,  (l/t)  eg  +  e^ )  would  be 

the  input  to  a  first-order  linear  filter  with  time  constant,  x; 
the  output  of  the  filter  would  be  the  gun  pointing  error. 


CONCLUSIONS 

In  this  paper,  modern  estimation  and  control  design  techniques 
were  applied  to  the  GLAADS  air  defense  system.  Specific  con¬ 
cepts  considered  in  this  study  include:  application  of  statis¬ 
tical  linearization  to  the  nonlinear  trajectory  dynamics  and 
tracking  sensor  measurements,  multiple-model  estimator  struc¬ 
turing  of  the  tracking  filter  to  accommodate  modeling  uncer¬ 
tainty,  stabilization  system  design  by  optimal  estimation  and 
control  techniques,  optimal  prediction  algorithm  design,  and 
command-generator- tracker/optimal  regulator  control  of  gun 
pointing.  The  performance  benefits  obtained  through  applica¬ 
tions  of  these  concepts  were  demonstrated  through  subsystem 
examples . 

Further  developments  of  these  concepts  should  include: 

•  Extension  and  evaluation  of  the  design 
formulations  for  non-planar  engagement 
scenarios 
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•  Development  of  practical  implementa¬ 
tion  structures  for  eventual  field 
testing 

•  Global  performance  evaluation  of  a 
mobile  air  defense  system  (i.e.,  error 
budget  structure)  to  identify  high- 
payoff  areas  for  subsystem  improvement 
and  development. 


REFERENCES 

1.  Glasson,  D.P. ,  and  Acharya,  N.A. ,  "Gun/Sensor  Stabiliza¬ 
tion  and  Control  Technology,"  Report  No.  ARSCD-CR- 81028 , 
Fire  Control  and  Small  Caliber  Weapon  Systems  Laboratory, 
U.S.  Army  ARRADCOM,  Dover,  New  Jersey,  July  1981. 

2.  Huling,  S.,  Mintz ,  M. ,  Goodman,  S.,  and  Dziwak,  W. , 

"Robust  Autoregressive  Models  for  Predicting  Aircraft 
Motion  from  Noisy  Data,"  Proceedings  of  IEEE  Conference 
on  Decision  and  Control,  Ft.  Lauderdale,  FL,  1979. 

3.  Gelb,  A.,  (Ed.),  Applied  Optimal  Estimation,  M.I.T. 

Press,  Cambridge,  1974. 

4.  Information  provided  by  Foreign  Intelligence  Office, 

U.S.  Army  ARRADCOM,  Dover,  New  Jersey  (SECRET). 

5.  Goldgraben,  J.R.,  and  Buchanan,  B.,  "Hull  Motion  Studies," 
Vol .  1  and  2,  Report  No.  DAAA25 -7 6-A- 0441/0001 ,  Decilog, 
Inc.,  Melville,  NY,  January  1979. 

6.  Thodos,  et.  al.,  "Gun  Low  Altitude  Air  Defense  System," 
Technical  and  Management  Summary,  Report  No.  RIA-R-TR- 
76-006,  Aircraft  and  Air  Defense  Weapon  Systems  Direc¬ 
torate,  Gen.  Thomas  J.  Rodman  Laboratory,  Rock  Island, 

IL,  June  1976. 


93 


Next  page  is  blank. 


I 


ON  DESIGNING  ROBUST  PREDICTORS  USING  FINITE  STATE 

MARKOV  CHAINS 


E.  B.  Pate  &  M.  Mintz 
Dept,  of  Systems  Eng. 
Univ.  of  Pennsylvania 
Phila.,  PA  19104 


W.  Dziwak  St  S.  A.  Goodman 
US  Army  Armament  Research 
and  Development  Command 
Dover,  NJ  07801 


ABSTRACT 

Traditionally,  control  systems  oriented  studies  of  uncertain 
dynamic  processes  with  continuous  state  descriptions  have 
relied  primarily  on  the  identification  and  estimation  of 
models  based  on  ordinary  differential  equations,  partial 
differential  equations,  or  related  difference  equations. 
Recently,  the  authors  have  explored  the  use  of  finite  markov 
chains  to  approximate  processes  with  continuous  state 
descriptions.  Our  applications  of  these  techniques  have 
focused  on  the  determination  of  robust  models  of  evasive 
targets  in  the  context  of  AAA  fire  control. 

We  have  developed  a  practical  technique  for  estimating 
robust  finite  state  markov  chain  models.  This  technique  is 
based  on  game  theory  and  provides  a  minimax  method  for 
determining  robust  approximations  of  markov  chain  state 
transition  matrices,  which  are  used  to  construct  target 
motion  predictors.  The  importance  of  this  new  approach  lies 
in  its  ability  to  provide  easily  implemented  predictors 
which  can  outperform  traditional  autoregressive  models  in  a 
noisy  environment. 

This  paper  outlines  this  new  methodology  and  provides 
examples  of  fire  control  applications  based  on  flight  test 
data . 


INTRODUCTION 

The  present  study  is  an  outgrowth  of  an  earlier  study  (1) 
which  characterized,  identified,  and  validated  robust 
mathematical  models  for  the  motion  of  an  attack  aircraft 
during  its  weapon  delivery  pass  against  a  defended  target. 
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These  new  msineuver  models  provided  the  basis  for  enhanced 
filtering  and  prediction  algorithms  for  AAA  fire  control 
systems.  The  development  of  these  models  and  algorithms  was 
based  on  a  synthesis  of  univariate  time  series  methods  and 
game  theoretic  analysis.  This  synthesis  lead  to  (i)  the 
development  and  validation  of  a  practical  design  procedure 
for  high  performance  target  state  estimators  in  the  presence 
of  moderate  to  large  parameter  uncertainty,  and  (ii)  a 
technique  for  designing  a  class  of  "worst  case'  maneuver 
processes  to  blunt  the  effectiveness  of  AAA  systems. 

A  central  aspect  of  the  research  reported  in  (1)  was  the  use 
of  authentic  flight  test  data,  which  consisted  of  eleven 
sample  flight  profiles  an  aircraft  might  perform  while 
attacking  a  defended  ground  target.  The  actual  data  was 
gathered  during  flight  tests  with  an  A7-E  aircraft  at  the 
NWTC,  China  Lake,  California.  These  attack  profiles,  which 
also  constitute  the  flight  test  data  base  for  this  present 
study,  are  described  in  detail  in  Chapter  II  of  reference^ 

(1) .  This  kinematic  data  base  describes  the  aircraft  motion 
in  a  cartesian  coordinate  system,  where  the  origin  of  this 
coordinate  system  is  the  aircraft's  intended  target,  as  well 
as  the  assumed  location  of  the  AAA  weapon  system.  The 
kinematic  data  describing  the  eleven  flight  profiles  in  the 
XYZ  coordinate  system  includes  consistent  position, 
velocity,  acceleration,  and  acceleration-dot  data  in  each 
coordinate  with  a  time  increment  of  0.1  sec.  The  primary^ 
models  developed  in  (1)  characterize  the  aircraft  motion  in 
terms  of  "decoupled"  autoregressive  (AR)  models  for  the 
individual  acceleration-dot  time  series  in  X,  Y,  and  Z.  We 
summarize  the  salient  results  of  this  earlier  study  with  the 
following  remarks; 

(i)  Although  the  eleven  flight  paths  appear  significantly 
different  to  the  "naked  eye,"  the  thirty-three 
acceleration-dot  time  series  in  the  data  base  —  eleven 
flight  paths  times  three  directions  —  are  shown  to  be 
accurately  modeled  by  a  single  robust  fifth— order 
autoregressive  model.  The  eleven  flight  paths  in  this  data 
base  include  three  dive  toss  maneuvers,  five  dive  maneuvers, 
and  three  pop-up  maneuvers.  The  acceleration-dot  processes 
were  incorporated  in  the  model  development  since  the 
acceleration,  velocity,  and  position  time  series  are  all 
significantly  nonstationary. 

(ii)  Substantial  improvements  in  overall  prediction 
capability  are  achievable  by  using  robust,  high— order 
filter-predictor  algorithms  based  on  a  fifth-order  AR  model 
of  acceleration— dot  instead  of  the  "usual  (benchmark) 
third-order  algorithms  based  on  a  first-order  AR  model  of 
acceleration . 

(iii)  Typical  improvements  in  average  hit  probability 
achieved  by  the  new  models  developed  in  (1)  versus  the 
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standard  benchmark  model  based  on  a  first-order  AR  model  of 
acceleration,  ranged  from  25  to  35  per  cent.  The  specific 
enhancement  in  average  hit  probability  associated  with  these 
new  filter-predictor  algorithms  depended  on  the  specific 
flight  path,  and  the  noise  levels  in  the  unfiltered 
observations.  The  unfiltered  observations  were  modeled  by 
target  range,  azimuth,  elevation,  and  the  respective  rates. 


ALTERNATIVE  MODELS 
Motivation 


The  consideration  of  dynamic  stochastic  models  for  target 
motion  based  on  alternatives  to  the  AR  models  described  in 
this  Introduction  and  detailed  in  (1)  is  motivated  by  the 
observation  that  predictors,  designed  on  the  basis  of 
nominal  fifth-order  AR  models  for  acceleration-dot,  showed 
extreme  sensitivity  to  observation  noise.  Here,  a  nominal 
AR  model  refers  to  a  model  obtained  by  a  standard  least 
squares  identification  procedure  in  conjunction  with  the 
usual  goodness  of  fit  tests.  This  extreme  sensitivity  to 
observation  noise  was  eliminated  in  (1)  by  the  following 
ad  hoc  approach;  the  nominal  fifth-order  AR  model  used  in 
the  predictor  was  replaced  by  a  modified  fifth-order  model 
with  greater  damping  and  reduced  bandwidth.  The  overall 
filter/predictor  system  was  characterized  by  three  decoupled 
channels  for  the  X,  Y,  and  Z  coordinates.  The  individual 
Kalman  filters  were  implemented  based  on  the  nominal  values 
of  the  X  triple-dot  AR  coefficients  for  Pass  10.  The 
performance  of  this  simplified  filter  structure  with  these 
specific  coefficients  was  studied  extensively  through 
simulation  experiments.  The  results  of  this  study  indicated 
that  this  simplifed  filter  provided  good  performance  against 
all  of  the  observed  flight  profiles.  A  worst  case  analysis 
based  on  game  theoretic  techniques  revealed  that  this 
simplified  nominal  filter  should  be  replaced  by  a 
structurally  identical  filter  with  different  AR  parameter 
values  to  counter  plausable  worst  case  maneuver  processes 
which  were  not  part  of  the  flight  test  data  base.  We  refer 
the  reader  to  (1)  for  a  complete  description  of  this  worst 
case  analysis  and  filter  design  technique.  The  coefficients 
for  the  individual  X,  Y,  and  Z  coordinate  predictors  were 
obtained  by  adjusting  the  overall  bandwidth  and  damping  to 
achieve  good  prediction  performance  based  on  the  filtered 
observations.  A  common  choice  of  modified  AR  coefficients 
was  implemented  for  each  of  the  three  predictors.  This 
adjustment  of  predictor  bandwidth  and  damping  necessitated 
some  empirical  analysis  via  simulation  studies.  It  is  worth 
emphasizing  that  these  "decoupled"  eighth-order 
nominal  filter/modified  predictors  provided  25  -  35  per  cent 
improvement  in  average  hit  probability  compared  with  the 
usual  "decoupled"  third-order  benchmark  algorithm.  We  note 
that  although  the  nominal  predictors  based  on  fifth-order  AR 
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models  of  acceleration-dot  showed  extreme  sensitivity  to 
observation  noise r  which  necessitated  modifying  the 
predictor ^s  frequency  response r  a  nominal  predictor  based  on 
the  Pass  10  X  triple-dot  AR  coefficients  provided  excellent 
prediction  capability  against  each  of  the  eleven  flight 
paths  in  a  noiseless  environment.  This  substantial 
sensitivity  to  observation  noise  is  of  more  than  academic 
interest.  It  suggests  that  the  underlying  process  (truth 
model)  which  actually  characterizes  the  aircraft  motion  may 
exhibit  structural  details  which  are  not  embraced  by  AR  or 
more  generally  by  autoregressive  integrated  moving  average 
(ARIMA)  approximations.  This  last  statement  is  not  intended 
to  imply  that  ARIMA  models  are  unsuited  for  maneuver  process 
modeling,  but  rather  that,  in  the  set  of  all  possible 
maneuver  process  models,  there  may  be  alternative  classes  of 
models  which  provide  better  explainations  of  the  underlying 
process.  One  recognizes,  in  applications  oriented  model 
building,  that  there  is  a  delicate  balance  which  must  be 
sought  between  process  explaination  (complexity)  and 
implementation  approximations  (simplicity) .  The  choice  of 
pivot  point  for  this  balance  is  strongly  influenced  by  the 
model^’s  intended  application. 

Models  Based  on  Aerodynamic  Variables  and  Aircraft  Aspect 

We  began  our  consideration  of  alternative  models  for  evasive 
attack  aircraft  motion  by  examining  an  overall  model 
structure  which  was  delineated  in  terms  of  aerodynamic 
variables  and  aircraft  aspect.  This  new  model  was  based  on 
an  integration  of  finite  markov  chain  models  for  aircraft 
normal  acceleration  with  ARIMA  models  for  aircraft 
tangential  acceleration  and  bank  angle.  The  consideration 
of  dynamic  stochastic  models  for  target  motion  based  on 
target  aspect  (bank  angle)  as  well  as  aerodynamic  variables 
(normal  and  tangential  acceleration)  was  motivated  by  an 
earlier  investigation  reported  in  (2) .  These  earlier 
results  indicated  that  enhanced  prediction  capability  might 
be  achievable  based  on  prediction  algorithms  defined  in 
terms  of  target  aspect,  airspeed,  and  normal  acceleration, 
particularly  over  extended  prediction  intervals  (e.g.  3-5 

sec.).  We  remark  that  other  investigators  working  in  the 
air-to-air  fire  control  environment  have  recently  considered 
state  estimation  algorithms  based  on  target  aspect  and 
normal  acceleration.  However,  these  collateral  works,  which 
are  reported  in  (3  —  5) ,  do  not  make  use  of  any  flight  test 
data. 

The  factors  which  suggest  modeling  target  motion  in  terms  of 
bank  angle,  normal  acceleration,  and  tangential  acceleration 
are : 

(i)  A  desire  to  describe  target  motion  in  terms  of  decision 
variables  under  the  control  of  the  pilot. 


98 


(ii)  The  recognition,  based  on  theoretical  considerations  as 
well  as  empirical  studies,  that  the  stochastic  dynamic 
behavior  of  the  individual  X,  Y,  &  Z  acceleration-dot  time 
series  for  a  given  flight  profile  are  strongly  coupled  in  a 
noncausal  fashion. 

(iii)  The  recognition  that  alternative  models  for  target 
motion  based  on  aspect  and  aerodynamic  variables  could  allow 
the  exploitation  of  partially  redundant  dynamic  data  in  the 
context  of  seeking  enhanced  prediction  capability  through 
multisensor  integration. 

Preliminary  Results 

(i)  The  first  phase  of  the  present  study  focused  on  the 
identification  and  estimation  of  univariate  time  series 
models  for  target  bank  angle  (BA) ,  normal  acceleration  (NA) , 
and  tangential  acceleration  (TA) .  These  results  indicate 
that  the  eleven  BA,  NA,  &  TA  time  series  can  be  adequately 
modeled  by  three  separate  ARIMA  models.  By  the  phrase 
"adequately  modeled,"  we  mean  that  based  on  noiseless  data 
there  is  inadequate  evidence  to  support  the  rejection  of  the 
null  hypothesis.  The  phrase  "adequately  modeled"  is  not 
intended  to  imply  that  these  models  necessarily  have  good 
sensitivity  properties.  The  prediction  capabilities  of 
these  models  in  a  noisy  environment  will  be  described 
subsequently . 

(ii)  A  single  input  single  output  (SISO)  transfer  function 
analysis  indicates  that  while  there  are  weak  causal 
relations  between  BA  and  NA,  and  between  NA  and  TA,  it  is 
adequate  to  treat  the  individual  BA,  NA,  &  TA  time  series 
for  a  given  flight  path  as  independent  series. 

(iii)  The  NA  time  series  for  each  flight  path  exhibits 
significant  piecewise  linear  behavior.  This  suggests  that 
the  rate  of  change  of  normal  acceleration  can  be  modeled 
approximately  as  a  finite  state  markov  chain.  Detailed 
analysis  indicates  that  this  finite  state  markov  chain  model 
is  quite  competitive  with  the  previously  described  ARIMA 
model  for  NA  as  judged  by  the  relative  prediction  capability 
of  each  model  in  a  noiseless  environment. 

(iv)  A  comparative  analysis  of  predictor  performance  for  the 
case  of  noiseless  data  indicates  that: 

(a)  The  hybrid  predictor  (based  on  a  nominal  markov 
chain  model  for  NA-dot,  and  nominal  ARIMA  models  for  TA  & 

BA)  performs  comparably  to  the  predictor  comprised  of 
nominal  ARIMA  models  for  NA,  TA,  &  BA. 

(b)  The  performance  of  both  of  these  new  predictors  is 
comparable  to  that  of  the  robust  predictor  based  on 
acceleration-dot  obtained  in  (1) . 

(c)  The  nominal  fifth-order  AR  models  based  on 
acceleration-dot  obtained  in  (1)  performs  noticeably  better 
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than  each  of  the  other  predictors  cited  in  (iv-a)  and 
(iv-b) .  The  high  performance  capability  of  these  nominal 
fifth-order  AR  models  in  a  noiseless  environment  is  not 
practically  useful  because  of  the  severe  noise  sensitivity 
of  these  models  as  noted  previously. 

(v)  A  comparative  analysis  of  predictor  performance  for  the 
case  of  noisy  data  indicates  that: 

(a)  The  nominal  ARIMA  models  of  NAf  TAf  &  BA  exhibit 

the  same  extreme  sensitivity  to  observation  noise  as  was 
found  for  the  nominal  fifth-order  AR  models  of 
acceleration— dot  obtained  in  (1)  •  ^  ^ 

(b)  The  nominal  finite  state  markov  chain  model  for 
NA-dot  showed  substantial  noise  immunii^. 


Preliminary  Conclusions 

We  draw  the  following  conclusions  from  these  preliminary 
results: 

(i)  The  nominal  finite  state  markov  chain  model  has 
substantially  better  noise  immunity  than  its  nominal  ARIMA 
counterpart . 

(ii)  Logical  directions  for  further  research  include: 

(a)  A  study  of  the  behavior  of  finite  state  markov 
chain  approximations  to  the  TA  &  BA  processes. 

(b)  A  study  of  the  behavior  of  finite  state  markov 
chain  approximations  to  the  X,  Y,  &  Z  acceleration-dot 
processes  resolved  in  a  cartesian  coordinate  system. 

We  decided  to  direct  our  phase-two  efforts  towards  a 
detailed  study  of  finite  state  markov  chain  approximations 
of  the  acceleration-dot  processes  resolved  in  a  cartesian 
coordinate  system.  We  chose  this  direction  over  the  finite 
state  markov  chain  study  of  NA,  TA,  &  BA,  since  current 
sensor  capabilities  do  not  suggest  the  immediate 
availability  of  direct  or  indirect  aircraft  bank  angle  data 
with  a  suitable  S/N  ratio. 


MARKOV  CHAIN  MODELS  of  A-DOT 

In  this  section  we  describe  the  determination  and  validation 
of  nominal  finite  state  markov  chain  approximations  to  the 
individual  X,  Y,  &  Z  acceleration-dot  processes  for  the 
flight  paths  in  our  data  base.  The  following  approach  is 
justified  if  we  assume  that  the  underlying  A-dot  data  is 
generated  by  a  (possibly)  infinite  state  wide-sense 
stationary  markov  process  with  sample  paths  that  exhibit 
piecewise  constant  behavior  over  intervals  of  random 
duration.  We  refer  the  reader  to  reference  (6)  for  a  more 
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rigorous  probabilistic  description  of  the  class  of 
underlying  processes  for  which  our  approximation  algorithm 
is  theoretically  valid.  In  the  following  discussion  we 
focus  our  attention  on  a  generic  sample  path,  for  example, 
the  X  triple-dot  realization  for  Pass  1. 


An  Approximation  Algorithm 


Our  approximation  algorithm  has  the  following  steps: 


Step  1  -  Dynamic  Range;  Determine  the  dynamic  range  (maximum 
&  minimum  values)  of  the  time  series  data. 


Step  2  -  State  Assignment;  Determine  a  finite  set  of 
"states"  for  the  time  series  data. 


Step  3  -  Transition  Matrix  Estimation;  Estimate  the 
probabilities  in  the  one-step  state  transition  matrix  — 
based  on  the  state  assignments  made  in  Step  2. 

Step  4  -  Repeat  Steps  1-3  for  the  remaining  time  series 
data  sets. 


Step  5  -  Hypothesis  Testing;  Perform  the  appropriate 
statistical  tests  to  evaluate  the  validity  of  the  hypothesis 
that  the  derived  finite  state  markov  chains  have  a  common 
underlying  distribution.  (We  refer  the  reader  to  reference 
(7)  for  an  excellent  treatment  of  hypothesis  tests  for 
finite  state  markov  chains.  ) 


Some  Comments  on  the  Algorithm 

Step  2;  There  is  no  canonical  method  to  resolve  the  state 
assignment  question.  We  have  explored  two  methods  in  this 
preliminary  analysis.  These  methods  are;  (a)  the  method  of 
uniform  dynamic  range,  and  (b)  the  method  of  uniform 
duration.  The  method  of  uniform  dynamic  range  breaks  the 
dynamic  range  of  the  time  series  into  M  equal  intervals. 

(In  this  analysis  we  chose  M  =  5.  )  The  center  point  of 
each  interval  was  defined  to  be  the  value  of  the  associated 
state.  The  method  of  uniform  duration  selects  the  M  "state 
intervals"  such  that  the  number  of  "occupancies"  in  each 
interval  are  identical  (or  approximately  identical) .  Here 
again,  the  value  of  M  was  chosen  to  equal  5,  and  the  center 
point  of  each  interval  was  used  to  designate  the  state 
value.  The  selection  of  M  =  5,  as  well  as  the  state 
assignment  procedure  represents  judgmental  calls  on  the  part 
of  the  investigators.  Further  research  on  these  issues  is 
currently  being  pursued. 

Step  3;  The  underlying  one-step  state  transition  matrices 
were  estimated  using  the  method  of  maximum  likelihood  (7) . 


101 


structural  and  Statistical  Results 

Although  the  flight  paths  all  look  very  different  to  the 
"naked  eye,"  the  estimated  one-step  state  transition 
matrices  shov/ed  remarkable  similarity  between  the  X,  Y,  &  Z 
directions  for  a  given  flight  path,  and  between  all  of  the 
flight  paths.  This  result  corresponds  to  the  related 
behavior  of  the  acceleration-dot  spectra  reported  in  (1) . 
Use  of  the  uniform  dynamic  range  rule  for  state  assignment 
led  to  one-step  transition  matrices  which  exhibit  a 
pronounced  birth-death  behavior. 

Prediction  Using  Markov  Chains 

Prediction  based  on  a  MMSE  criterion  is  easily  carried  out 
for  a  finite  state  markov  chain  model.  The  propagation  of 
the  conditional  expectation  of  the  state  n  steps  into  the 
future  is  obtained  by  raising  the  one-step  state  transition 
matrix  to  the  nth  power  and  then  multiplying  the  resulting 
matrix  by  a  column  vector  of  state  values.  The  desired 
value  of  the  conditional  expectation  is  merely  a  weighted 
sum  of  n-step  state  transition  matrix  entries. 

Numerical  Results 

In  this  section  we  present  the  results  of  a  series  of 
simulation  experiments  which  were  carried  out  to  test  the 
validity  of  this  finite  state  markov  chain  approximation 
concept.  These  results  represent  two  separate  classes  of 
finite  state  markov  chain  approximations.  The  first  model 
denotes  a  finite  state  markov  chain  approximation  to  X,  Y,  & 
Z  triple-dot.  The  predictions  are  based  on  a  "multiple 
integration"  of  the  predicted  values  of  acceleration-dot, 
acceleration,  and  velocity  in  each  coordinate.  A  standard 
Taylor  series  model  defines  the  overall  predictor  structure. 
The  second  model,  which  we  introduce  very  briefly,  denotes  a 
finite  state  markov  chain  approximation  to  the  individual 
components  of  the  derivative  of  the  aircraft's  angular 
velocity  vector  (Omega- dot) .  Detailed  motivation  for  this 
choice  of  state  variable  description  of  the  aircraft  motion 
appears  in  (6) .  Our  purpose  in  presenting  these  additional 
results  here  is  to  illustrate  that  the  choice  of  state 
variable  description  or  model  coordinatization  is  not 
uniquely  or  canonically  determined.  Both  classes  of  models 
have  been  exercised  against  the  benchmark  predictor.  Since 
sensitivity  to  observation  noise  is  a  critical  issue, 
predictor  performance  with  and  without  noise  has  been 
investigated.  The  signal  to  noise  ratios  used  in  these 
experiments  correspond  roughly  to  the  low  level  noise  figure 
experiments  performed  in  (1).  In  the  current  experiments, 
uncorrelated  observation  noise  was  added  to  the  data  prior 
to  the  prediction  calculations.  No  filtering  was  carried 
out  on  these  data  prior  to  the  prediction  calculations. 

Since  Kalman  filters  were  employed  in  the  experiments 
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reported  in  (1) ,  a  complete  correspondence  between  these 
preliminary  experiments  and  the  results  reported  in  (1) , 
cannot  be  made.  The  results  of  the  present  experiments 
should  be  viewed  as  preliminaryr  since  the  markov  chain 
predictors  which  were  implemented  in  each  case  were  based 
solely  on  the  Pass  1  time  series  data.  This  choice 
represents  a  first  cut,  and  illustrates  the  excellent 
performance  of  the  markov  chain  approximations,  which  did 
not  require  any  tuning.  The  results  of  the  prediction 
experiments  are  contained  in  Table  1.  We  remark  that  both 
classes  of  markov  chain  models  performed  well  against  the 
benchmark  model,  and  both  exhibited  significant  noise 
immunity.  The  performance  of  the  nominal  (non  robust) 
fifth-order  AR  models  for  acceleration-dot  yield 
corresponding  average  hit  probabilities  significantly  less 
than  0.1  at  this  noise  figure.  Further  results  relating  to 
the  robustization  of  these  preliminary  markov  chain 
approximations  are  reported  in  (6,  &  8) .  Reference  (6) 
includes  a  worst  case  analysis  of  the  markov  chain 
approximation  process,  as  well  as  the ^ saddle— point  theory 
which  justifies  the  worst  case  analysis. 


CONCLUSIONS 

This  study  has  presented  substantial  evidence  to  support  the 
assertion  that  finite  state  markov  chain  approximations  can 
be  used  in  place  of  AR  (and  ARIMA)  models  as  the  basis  for 
predictors  in  a  realistic  AAA  fire  control  application.  The 
value  of  the  markov  chain  approximations  lie  in  their 
substantial  immunity  to  observation  noise.  It  has  been 
shown  previously  in  (1)  that  high  order  AR  models  can 
pi^ovide  a  substantial  enhancement  in  prediction  performance 
in  comparison  with  the  usual  decoupled  third-order  benchmark 
model.  However,  the  application  of  these  high  order  AR 
models  requires  that  the  nominal  parameter  values  be 
adjusted  to  reduce  the  bandwidth  and  increase  the  damping  of 
the  given  predictor.  This  adjustment  procedure  requires  a 
degree  of  numerical  experimentation  to  obtain  a  suitable 
design.  Whereas,  the  markov  chain  approximations  need 
essentially  no  tuning  to  obtain  a  design  with  excellent 
observation  noise  immunity,  and  therefore  these  models  can 
be  used  "directly  off  the  shelf". 


REFERENCES 

(1)  S.  F.  Huling,  and  M.  Mintz,  "ENHANCED  FILTERING  & 
PREDICTION  FOR  AAA  FIRE  CONTROL:  An  Application  of  Game 
Theory  and  Time  Series  Analysis,"  Final  Report  on  Contract 
No.  DAAK10-78-C-0186 ,  Submitted  to  US  Army  ARRADCOM,  May 
1980.  Published  as  Contractor  Report  ARSCD-CR-80015 , 


104 


(2)  S.  F.  Huling,  and  M.  Mintz,  "ROBUST  AUTOREGRESSIVE 
PREDICTORS  FOR  AIRCRAFT  MANEUVERS:  A  Comparative  Study," 
Final  Report  on  Contract  No.  DAAA25-76-C-0435,  Submitted  to 
US  Army  Frankford  Arsenal,  October  1978. 

(3)  J.  D.  Kendrick,  "Estimation  of  Aircraft  Target  Motion 
Using  Pattern  Recognition,"  Ph.  D.  Dissertation,  Air  Force 
Institute  Of  Technology,  1978. 

(4)  J.  D.  Kendrick,  P.  S.  Maybeck,  and  J.  G.  Reid, 
"Estimation  of  Aircraft  Target  Motion  Using  Orientation 
Measurments , "  IEEE  Transactions  On  Aerospace  And  Electronic 
Systems ,  Vol.  AES- 17,  No.  2,  March  1981,  pp.  254-25^9. 

(5)  C.  A.  McNary,  et.  al.,  "Motion  Estimation  Using  Target 
Aspect  (META)  Feasibility  Study,"  Interim  Report  on  Contract 
No.  F33615-77-C-1227 ,  Submitted  to  US  Air  Force  Avionics 
Laboratory,  October  1979,  AFAL-TR-79-1077 . 

(6)  E.  B.  Pate,  and  M.  MintZ,  "NEW  TECHNIQUES  FOR  DESIGNING 
ROBUST  PREDICTORS  FOR  AAA  FIRE  CONTROL  SYSTEMS:  An 
Application  of  Markov  Chain  Models  and  Game  Theory,"  Final 
Report  on  Contract  No.  DAAK10-79-C-0232. 

(7)  T.  W.  Anderson,  and  L.  A.  Goodman,  "Statistical 
Inference  About  Markov  Chains,"  Annals  Of  Mathematical 
Statistics ,  Vol.  28,  1957,  pp.  89-110. 

(8)  E.  B.  Pate,  M.  MintZ,  W.  Dziwak,  and  S.  A.  Goodman,  "On 
Designing  Robust  Predictors  Based  On  Finite  State  Markov 
Chains:  A  Game  Theoretic  Approach,"  Proceedings  of  the 
Nineteenth  Annual  Aller ton  Conference ,  October  1981. 


105 


Next  page  is  blank. 


ROBUST  KALMAN  FILTERING 
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ABSTRACT 

This  is  a  preliminary  report  on  the  current  research  on  the  appli¬ 
cability  of  robust  Kalman  filtering  in  monopulse  radar  tracking 
systems.  (An  estimation  procedure  is  robust  if  small  perturbations 
in  the  noise  model  from  the  assumed  (Gaussian)  noise  model  result 
in  only  small  changes  in  the  mean-squared-error  of  estimate.) 


INTRODUCTION 

Optimal  guidance  laws  for  accelerating  targets  depend  on  adequate 
estimates  of  key  states  such  as  line-of-sight  rate  and  target 
normal  accelerations.  When  optimum  state  estimates  of  these  para¬ 
meters  are  available,  superior  missile  performance  can  be  demon¬ 
strated.  These  estimates  are  usually  provided  by  some  Kalman 
filter  algorithm.  In  the  classical  Kalman  filter,  noise  sources 
are  characterized  by  uncorrelated  Gaussian  statistics.  Unfortu¬ 
nately,  as  the  research  of  Masreliez  and  Martin  [1]  demonstrates, 
the  behavior  of  Kalman  filter  algorithms  can  be  severely  degraded 
when  the  actual  noise  disturbances  are  non-Gaussian,  particularly 
when  the  non-Gaussian  behavior  is  heavy  tailed.  The  latter  behav¬ 
ior  is  characterized  by  outliers.  In  monopulse  radars  this  heavy 
tailed  non-Gaussian  behavior  is  present  in  the  angle  tracking  sig¬ 
nals  because  of  target  glint.  A  monopulse  tracker  will  turn  until 
the  central  axis  of  the  receiver  antenna  is  aligned  with  the 
normal  of  the  incoming  (scattered)  wavefront.  For  a  point  target 
the  scattered  wavefront  is  spherical,  and  thus  the  gradient  is 
always  along  a  radial  vector  directed  from  the  receiver  aerial  to 
the  target.  A  complex  target  is  composed  of  a  number  of  spatially 
separated  scatterers  and  thus  the  spherical  wavelets  from  the 
individual  "point"  scatterers  will  interfere.  As  a  result  of  this 
interference  the  phase  of  the  received  signal  will  not,  in  general, 
be  independent  of  the  target  aspect.  The  component  of  the  phase 
gradient  vector  orthogonal  to  the  radial  direction  is  directly  re¬ 
lated  to  tracking  error.  It  is  this  type  of  error  that  constitutes 
Uiirit.  Since  the  phase  of  a  complex  target  varies  as  a  function 
of  target  aspect  and  motion,  the  statistics  for  the  Kalman  filter 
are  non-stationary .  Thus  the  Kalman  filter  must  adapt  to  the  non¬ 
stationary  glint  statistics.  Moreover,  the  pulse  repetition  rate 
of  the  radar  is  generally  much  higher  than  the  requisite  tracking 
rate.  For  this  reason  the  Kalman  filter  generally  processes  a  set 
of  statistics  based  on  a  fixed  number  of  sampled  radar  measure¬ 
ments.  If  these  statistics  are  the  mean  and  variance  of  the 
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angular  glint  based  on  a  sequence  of  radar  pulses,  then  they  will 
be  sensitive  to  intrinsic  signal  outliers  (i.e.,  large  glint 
spikes) .  For  this  reason  robust  statistical  alternatives  to  the 
mean  and  variance  are  considered.  Informally,  a  robust  estimate 
is  one  whose  performance  remains  quite  good  when  the  true  distri¬ 
bution  of  the  data  deviates  from  the  assumed  distribution.  The 
mean  and  variance  are  sensitive  to  large  changes  in  a  small  frac¬ 
tion  of  data  points.  Robust  estimates  are  more  resistant  to 
outliers.  The  purpose  of  this  research  is  to  clarify  the  role  of 
robust  estimates  in  angle  tracking. 

In  this  paper  the  basic  angle  tracking  loop  simulation  is  outlined 
and  some  preliminary  results  are  presented.  In  addition,  the  role 
of  robust  estimates  in  the  tracking  loop  are  identified  and  some 
comparisons  of  the  mean  and  variance  with  robust  estimates  are 
presented  using  some  simulated  glint  data. 


TRACKING  LOOP  MODEL 

The  basic  angle  tracking  loop  is  defined  in  Figure  1.  The  track¬ 
ing  loop  and  Kalman  filter  equations  are  derived  and  discussed  in 
the  paper  by  Pearson  and  Stear  [2] .  In  this  section  the  Kalman 
filter  equations,  the  controller  and  the  antenna  stabilization 
loop  denoted  by  G{S)  are  defined. 


TRACKiNG  RATE 


WLS  -  *-0S  RATE  €  -  ANGLE  POIK.ING  ERROR 

-  ANTENNA  RATE  ’  -  MEASURED  POINTING  ERROR 

<^AC  -  CGP^MANDEO 

ANTENNA  RATE 

FIGURE  1.  Pointing  Error  Control  System. 

For  this  simulation  a  planar  engagement  is  assimed  and  for  the  ini¬ 
tial  studies  the  radar  is  fixed  and  the  target  is  executing  a  con¬ 
stant  g  turn.  Also,  it  is  assumed  that  line  of  sight  and  antenna 
gimbal  coordinate  frames  are  almost  coincident.  Let  e  denote  point¬ 
ing  error,  which  is  the  integrated  difference  between  the  line-of- 
sight  rate  w  g  and  antenna  rate  Let  a^  and  a^  denote  the 
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target  and  interceptor  accelerations.  The  Kalman  filter  state 
equations  are 

^  =  “ls  ■  “a 

•  _  2R  .  1  .  ^  X 

‘^LS  R  “lS  R  ^  T 


with  observation  equation 
y  =  ke  +  r, 

where  k  is  the  angle  error  receiver  slope  and  r  and  q  denote  addi¬ 
tive  noises.  This  formulation  of  the  Kalman  filter  in  line-of- 
sight  coordinates  iS  appealing  because  the  filter  is  linear  and  the 
second  state  equation  reduces  to  the  formula  for  the  transverse  . 
acceleration  of  a  particle  moving  in  a  plane.  The  constants  R,  R 
and  T  are  respectively  range,  range  rate  and  target  acceleration 
time  constant.  The  estimates  of  R  and  R  are  provided  by  the  radar. 

The  estimates  e  and  Wto  produced  by  the  Kalman  filter  are  combined 

Liib 

linearly  to  produce  an  antenna  rate  command  of  the  form 

/s  ^ 

“aC  ""  ^i"lS  ^2^ 

where  is  usually  unity  and  is  proportional  to  the  tracking 
loop  rate. 

The  antenna  stabilization  loop  is  modeled  as  a  third  order  system 
whose  states  are  antenna  postion,  rate  and  acceleration,  which  are 
denoted  by  x^,  ^2  and  x^.  Let  C  denote  the  damping  ratio  and  u 

the  undamped  natural  frequency.  Finally,  let  u(t)  denote  a  contin¬ 
uous  real  valued  input  function.  The  state  equations  can  be  writ¬ 
ten  as 


•  2  2 

=  -2^a)X2  -  0)  X2  +  W  u(t) 

Xi  =  X2. 
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EXACT  SOLUTIONS  OF  THE  STATE  TRANSITION  MATRICES 


In  this  section  the  state  transition  matrices  for  the  Kalman 
filter  and  the  antenna  stabilization  loop  are  derived.  First, 
the  transition  matrix  for  the  Kalman  filter  is  derived.  Next, 
the  transition  matrix  that  is  used  to  represent  the  antenna  sta¬ 
bilization  loop  and  integrator  in  the  pointing  error  control 
system  using  Kalman  estimates  is  derived.  Finally,  the  transition 
matrix  is  used  to  obtain  the  sampled-data  time  domain  representa¬ 
tion  of  the  stabilization  loop  integrator  with  a  hold  circuit. 

Both  of  these  transition  matrices  are  derived  using  spectral 
operator  theory.  These  techniques  for  finite  dimensional  opera¬ 
tors  are  discussed  in  Lancaster  [3]  and  Zadeh  and  Desoer  [4]. 

Before  deriving  the  exact  solutions  of  these  systems,  the  concepts 
and  formulas  that  are  essential  in  the  development  of  spectral 
operator  theory  for  functions  of  matrices  are  presented.  Assume 
the  following  state  variable  formulation 


X  =  Ax  +  Bu 

(1) 

II 

o 

(2) 

where  A,  B  anc  C  are  respectively  n  x  n,  n  x  1  and  1  x  n  constant 
matrices  and  x,  y  and  u  are  real  valued  state  vectors  of  compati¬ 
ble  dimensions.  To  simplify  the  discussion  it  is  assumed  through¬ 
out  this  section  that  the  eigenvalues  of  A  are  distinct.  Let 

.  .  .  ,  denote  the  eigenvalues  of  A.  The  basic  strategy  of 

this  section  is  to  solve  the  linear  system  (1)  by  first  deriving 
an  exact  expression  for  exp (At)  and  then  solving  the  forced  sys¬ 
tem  by  integrating  the  general  solution.  The  spectral  operator 
theory  develops  an  exact  expression  for  elementary  functions  with 
matrix  arguments.  For  example,  the  theory  justifies  the  substi¬ 
tution  of  a  matrix  A  for  the  real  variable  in  the  power  series 
expansion  of  the  elementary  functions  such  as  exp(J2.)  and  sin(£). 
However,  computing  the  matrix  exp  (Ail)  by  the  infinite  power  ser¬ 
ies  can  be  difficult  except  in  very  special  cases  or  for  small 
values  of  t.  A  finite  power  series  expansion  can  be  obtained  by 
using  the  following  fundamental  formula  for  the  function  of  a 
matrix.  Given  any  real  valued  function  f,  which  is  analytic  at 
the  eigenvalues  of  A 

f(A)  =  ?  f(A,  )Z,  (3) 

k=l  ^  ^ 

where  •  •  •  '  distinct  eigenvalues  of  the  n  x  n 

matrix  A  and  the  matrices  have  constant  elements  and  depend 

exclusively  on  A.  The  n  x  n  matrices  Zj^  are  called  the  components 

of  A.  The  component  matrices  are  linearly  independent  nonzero 
matrices  that  commute  with  A  and  satisfy  the  following  identities 


no 


n 

(a)  E  Z,  =  1/ 
k=l  ^ 

(b)  =  Zj^  for  each  k 

(c)  Zj^Zj  =  0  for  k  7^  j 

Where  I  and  0  are  respectively  the  n  x  n  identity  and  null  matrix. 
Property  (a)  follows  from  the  fundamental  formula  (3)  by  letting 
f(X)  =  1.  Matrices  that  satisfy  properties  (b)  and  (c)  are  called 
respectively  idempotent  and  orthogonal.  Since  the  eigenvalues  of 
A  are  assumed  to  be  distinct,  the  can  be  computed  by  the 
formula 


n  n 

z,  =  n  (A  -  X.I)/  n  (X  -  X.) 

^  j=l  3  3 

jf^k  jT^k 

where  n  is  the  product  symbol.  An  alternative  method  for  com¬ 
puting  the  component  matrices  is  given  in  Lancaster  and  Zadeh. 
When  f (X)  =  eX  or  f(X)  =  X  there  results 

n 

exp  (At)  =  E  exp(X,t)Z, 
k=l  ^  ^ 


n 

A  =  E  X,  Z 


The  spectral  theory  of  an  operator  can  be  very  useful  in  computing 
some  fundamental  properties  of  exp (At).  By  property  (a)  for  t  =  0 


n 

exp  (At)  =  E  Z,  =  I, 
k=l  ^ 


which  is  one  of  the  basic  properties  of  the  state  transition 
matrix.  To  show  that  exp (At)  when  computed  by  the  fundamental 
formula  (3)  is  a  solution  of  (1)  with  zero-input  (i.e.,  the  input 
vector  u  is  zero  for  all  time)  first  form  the  product 


A  exp (At)  = 


n 

E  X,  Z, 

k=i 


n 


E  exp(X,t)Z, 
k=l  ^  ^ 


and  then  apply  the  idempotent  and  orthogonality  properties  of  the 
component  matrices  to  the  product  to  obtain 


n 


n 


E  Xk  exp(Xj^t)Zj^  =  E  exp(X^t)Z^ 


k=l 


k=l 
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which  proves  that  exp (At)  is  a  solution  of  (1)  with  zero-input 
response. 

These  techniques  are  now  used  to  solve  for  the  state  transition 
matrix  in  the  Kalman  filter.  The  3  x  3  A  matrix  is 


where  R,  R  and  t  are  respectively  range,  range  rate  and  the  tar¬ 
get  acceleration  time  constant.  Since  A  is  an  upper  triangular 
matrix  the  eigenvalues  of  A  are  the  diagonal  entries.  These  are 

O-D  1 

denoted  by  =  0,  ^2  "  "  ^  ^3  "  "  t*  Using  the  product  for¬ 

mula,  the  formal  expressions  for  the  correstponding  component 
matrices  become 

=  ^A^  -  (X2+X3)A  +  X2X3lj/X2X3 

Zj  =  ^a2-X3a|/A2(Xj-X3) 

Zj  =  /AjIAj-Aj), 

where  A^  denotes  the  product  of  a  matrix.  After  some  tedious 
computations  these  expressions  become 
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Using  the  fundamental  formula  the  exact  finite  series  expansion 
for  exp (At)  can  be  written  as 

exp  (At)  =  exp(0)Z^  +  exp^^ 

This  solution  clearly  exhibits  the  dependence  of  exp (At)  on  the 
eigenvalues  of  A;  thus,  the  transient  response  of  the  zero-input 
response  system  can  be  readily  analyzed. 

If  t<<l  so  that  exp(Xt)  can  be  approximated  by  the  first  two  terms 
in  its  series  expansion,  then  the  transition  matrix  becomes  the 
sum  of  two  matrices,  which  are 


This  expression  is  equivalent  to  integrating  the  linear  system 
with  zero-input  response  by  Euler's  method. 

Consider  the  3  x  3  A  matrix 


which  is  the  A  matrix  in  the  state  space  representation  of  the 
antenna  stabilization  loop  and  of  the  integrator  in  the  pointing 
error  block  diagram.  In  this  matrix  w  is  the  undamped  natural 
frequency  and  z,  is  the  damping  ratio.  The  eigenvalues  of  A  are 
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=  -?a)  + 


and  A  2 


=  where  lambda  bar  de¬ 


notes  complex  conjugate  and  j  =  Let 


0)  . 


denote 


the  damped  frequency  and  ct  =  cw  denote  the  reciprocal  of  the  time 
constant  of  the  antenna  stabilization  loop.  For  a  matrix  with 
one  real  eigenvalue  and  two  complex  conjugate  eigenvalues  the 
transition  matrix  exp (At)  can  be  decomposed  into  the  following 
useful  expression  that  is  given  in  Zadeh  and  Desoer  (page  611) . 


exp  (At)  = 
where  2Z  =  R  -  jx. 


exp(At)Zj^  + 


exp  (-at) 


^  R  cos (w^t) 


+  X  sin(w^ 


Again  the  spectral  decomposition  of  the  matrix  A  clearly  exhibits 
the  dependency  of  the  transition  matrix  on  a  and  the  damping  fre¬ 
quency  0)^.  By  utilizing  the  product  formula  for  the  component 
matrices  and  some  tedious  algebra  the  following  expressions  for 
the  component  matrices  can  be  obtained 


In  the  digital  simulation  of  the  error  tracking  loop  the  antenna 
response  and  the  integrator  are  computed  as  a  state  variable 
sampled  data  system.  The  general  solution  of  the  linear  system 
(1)  with  initial  conditions  given  at  t  =  0  and  x(0)  is 


7)4 


x(t)  =  exp(At)x(0)  +  f  exp(A(t-T) 

o  '• 

Suppose  the  input  is  sampled  with  period  T  and  between  sampling 
times  the  input  is  a  hold  circuit 

Bu(t)  =  Bu{NT)  for  NT  <  t  <  (N+1)T. 

Therefore  at  sample  time  t  =  NT 

x^(N+l)Tj  =  exp  (AT)  X  (NT)  +  h(T)u(NT) 

where 

T 

h(T)  =  /  exp(AJ.)Bd£. 

o 

The  only  task  left  in  computing  this  expression  is  the  evaluation 
of  h(T),  since  for  our  applications  exp (AT)  is  solved  exactly. 

If  A  is  nonsingular,  then  formally 

T  T  , 

/  exp(Ajl)Bd)l  =  f  (a"-^)A  exp(A5,)Bdil  =  A 
o  o 

This  expression  for  h(T)  is  a  direct  generalization  of  the  calcu¬ 
lus  for  one  dimensional  real  variables. 

In  our  application  one  of  the  eigenvalues  of  A  is  zero  and  so  the 
integral  must  be  evaluated  directly.  Since  the  constant  matrices 
commute  with  the  real  valued  exponential  and  trigonometric  func¬ 
tions  in  the  expansion  of  exp (AT),  the  integral  h(T)  is  evaluated* 
by  integrating  the  following  real  valued  functions 


j  Bu ( T ) d  . 


f  exp(-05/)  cos  (u)^A)dil  and 


/  exp(-ail)  sin  (w^X,)dil. 


Now  the  solution  of  the  first  integral  is 


-aT 

=  2.  2 
a  +(0, 


[( 


sin  (w^T)  -  a  cos  (w^T) 


)  ^ "] 


and  the  solution  of  the  second  integral  is 

--aT 


f2(a,a3d)  2,  2 

a  +(i3. 


|-a  sin  (w^T)  -  cos  (w^T)j  + 
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Thus,  the  exact  solution  of  h(T)  is 

h{T)  =  +  f^(a,a)^)R  +  f2(a,aj^)X- 

For  our  application  the  3x1  input  matrix  is 


where  K  is  a  constant  gain  matrix.  The  exact  value  of  K  depends 
on  the  requirements  of  the  sampled  data  system  in  the  error 
tracking  loop. 


EXPONENTIALLY  CORRELATED  TARGET  NOISE 

In  this  section  a  brief  discussion  of  first  order  exponentially 
correlated  noise  or  colored  noise  is  presented.  A  more  complete 
discussion  is  found  in  Jazwinski  [ 5] .  A  formal  mathematical  des¬ 
cription  of  such  a  process  is 

^  =  -aa  +  aato  0  <  t  (4) 

dt  = 

where  a  and  a  >  0  are  fixed  constants  and  o)  is  zero  mean,  white 
Gaussian  noise  with  correlation  function 

E  ^(jo  (t)  CO  (t)  j  =  5  (t  -  t)  . 

As  usual  6  (t)  is  the  Dirac  delta  function  and  E  is  the  expecta¬ 
tion  operator.  Formally,  the  solution  of  this  equation  is 

a(t)  =  e'^^aCO)  +  aa  /  e““  co  (S)  dS . 

o 

The  constant  a  is  the  variance  of  the  target  acceleration  and  a 
is  the  reciprocal  of  the  maneuver  (acceleration)  time  constant. 
Singer  [6]  proposed  the  exponentially  correlated  noise  process  as 
a  model  for  maneuvering  targets.  By  using  this  process  with  the 
appropriate  Kalman  filter,  he  proposed  a  tracking  algorithm  for 
piloted  threats.  In  his  paper.  Singer  proposes  the  following 
density  model  for  the  target  acceleration  variance.  For  complete¬ 
ness,  a  derivation  of  his  model  is  included. 

Let  A  denote  the  maximum  target  acceleration  rate.  Let  X  be 
max 

the  random  variable  that  denotes  the  target  acceleration  state. 
According  to  Singer, X  is  a  random  variable  with  the  following 
assigned  probabilities 
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P  =  ±  ""max)  =  W 


P(X  =  0) 


and  otherwise  the  target  accelerates  between  the  limits  and 

A  according  to  the  appropriate  uniform  distribution.  In  order 
max 

to  insure  that  X  is  a  bona  fide  random  variable  (i.e.,  P(-<»  <  X  <  “) 
=  1),  the  probability  of  the  mutually  exclusive  events 
<-Vx  <  X  <  0) ,  (0  <  X  <  must  be  1  -  +  P^) .  This 

implies  that  the  uniform  density  function  for  these  events  is 


f(X)  = 


*  ^0> 


The  variance  a  of  target  acceleration 


is  calculated  by  finding  the  variance  of  the  random  variable  X. 
The  expectation  of  X  is  zero  and  so  the  variance  of  X  is  given  by 


E(X^) 


1  -  ‘^'’max  ^  '’oM^A^ax  ^  „ 

2A  /  3  max  max 

max  / 


(1  +  4P, 


-  ^0>- 


This  completes  the  derivation  of  the  Singer  target  acceleration 
model . 

When  the  exponentially  correlated  noise  model  is  used  with  a 
sensor  having  a  constant  data  rate  with  sampling  period  T,  then 
(4)  can  be  converted  to  discrete  form.  By  sampling  the  solution 
of  (4) ,  starting  with  t  =  0,  the  recursive  form  becomes  at  the 
sampling  instances  jT 


^j+1 


=  Maj  +  Wj 


where 


aj  =  a{jT) 


exp (-aT) 


(j+l)T 

(jo.  =  aa  /  exp  (-a(j+l)T 

D  jT 


-  X)  w(X)dX, 


The  statistics  of  Wj  are  formally 


E(Wj)  =  0 
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E((OjW^)  =0  j  i 

E(a)j)  =  .5a(l  -  M^)a^. 

The  latter  equation  shows  how  the  sampled  noise  variance  is  related 
to  its  continuous  time  noise  variance.  Since  Wj  is  a  Gaussian 

process,  these  statistics  specify  its  density. 


POINTING  ERROR  CONTROL  SYSTEM  SIMULATION  EXAMPLE 

In  this  section  the  pointing  error  control  system  simulation  is 
briefly  discussed.  After  listing  the  initial ^conditions  and  para¬ 
meter  values,  some  preliminary  plots  of  the  simulation  output  are 
presented.  The  primary  purpose  of  these  plots  is  to  verify  the 
simulation  and  to  illustrate  the  effect  of  outliers.  The  requisite 
initial  conditions  and  parameters  to  implement  the  sampled  data 
Kalman  filter  algorithm,  sampled  data  antenna  stabilization  loop 
and  controller  gains  are  now  defined. 

Tracking  Period  At  =  .03  sec 

2 

Observation  Variance  R  =  .0001  (rad  ) 

2  4 

Target  Acceleration  Variance  Q  =  1.  (meters  /sec  ) 

Antenna  Damping  Factor  C  =  .4 

Antenna  Undamped  Natural  Frequency  w  =  100  (rad/sec) 

Controller  Gains  G^  =  1,  62=  l./At 

Target  Acceleration  Time  Constant  t  =  2. 

The  Kalman  filter  3x3  initial  covariance  matrix  was  initialized 
by  setting  all  diagonal  elements  equal  to  unity  and  the  off 
diagonal  elements  equal  to  zero.  For  this  example,  the  point 
target  executed  a  full  360  degree  constant  6  g  turn  with  target 
velocity  of  250  meters/sec.  The  outlier  noise  was  added  to 
angular  position  of  the  target  with  respect  to  the  radar  antenna. 
The  outlier  noise  model  was  generated  as  contaminated  distribution 
as  discussed  in  Tukey  [7] .  The  contaminated  model  was  generated 
as  the  mixture  of  two  normal  distributions,  each  with  zero 
mean  and  one  with  variance  equal  to  observation  variance  and  one 
with  ten  times  the  first  variance.  Let  us  denote  these  distribu¬ 
tions  as  N(0,R)  and  N(0,  lOR) ,  respectively.  The  mixture  of  the 
two  distributions  defined  as 

F(x)  =  pN(0,R)  +  (1  -  p)N(0,  lOR)  0  <  p  <  1 
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generates  the  contaminated  normal  distribution.  The  random  vari¬ 
able  X  with  this  distribution  can  be  generated  by  first  taking  a 
uniform  deviate  U;  if  U  <  p,  X  is  generated  by  independent  samples 
from  N(0,  R)  and  otherwise  choose  N(0,  lOR) . 


As  noted  by  Maybeck  [8]  and  Bierman  19] ,  the  usual  Kalman  filter 
algorithm  is  subject  to  numerical  difficulties.  To  overcome  these 
difficulties  the  UDU^  estimate-covariance  updating  algorithm  and 
the  modified  Gram-Schmidt  algorithm  for  the  time  update  algorithm 
of  Bierman  were  employed  in  this  simulation.  These  algorithms 
solve  the  Kalman  gain 


K  =  PH 


+  R 


the  error  covariance  matrix 


p  =  p  -  KH  p 

and  the  covariance  time  update 

~  ^  T  T 

P  =  $P$^  +  GQG 

where  the  subscripts  have  been  omitted  for  notational  simplicity. 

In  these  equations  is  the  Kalman  filter  state  transition  matrix, 
which  is  derived  in  the  previous  section,  H  is  the  observation 
row  matrix  (H  =  (1,0,0))  and  Q  is  the  process  noise  colvimn  matrix 
(G  =  (0,0,1)^),  P  is  the  covariance  matrix,  K  is  Kalman  gain  matrix 
and  Q  and  R  are  defined  in  the  parameter  list. 

A  few  selected  plots  from  the  simulation  are  illustrated  in 
Figures  2,  3,  4  and  5.  Figure  2  is  a  comparison  of  the  estimated 
target  position  with  actual  position.  Figure  3  is  a  comparison 
of  the  estimated  line  of  sight  rate  with  the  actual  line  of  sight 
rate.  Figure  4  is  a  plot  of  the  contaminated  normal  noise  added 
to  the  target  position.  The  outliers  are  clearly  evident  in  the 
plot.  Figure  5  is  a  plot  of  the  tracking  loop  innovations,  which 
is  the  difference  between  the  true  measurement  and  the  best  pre¬ 
diction  of  it  before  sampling.  Note  that  even  with  the  nonoptimum 
initial  estimates  and  sampling  period,  the  loop  maintains  track. 


119 


0.3  0.4  o.s  TR-'ZT  POSmC.  AND  NOISE 


ROBUSTIFYING  THE  KALMAN  FILTER 


A  statistical  procedure  is  resistant  110]  if  the  value  of  esti¬ 
mate  is  insensitive  to  a  small  change  in  all  of  the  data  values 
or  'to  large  changes  in  a  few  of  the  data  values.  According  to 
Huber  [11]  this  is  a  working  definition  that  can  be  used  to  define 
robustness.  Figure  6  is  a  plot  of  linear  glint,  which  clearly 
exhibits  the  presence  of  outliers  (glint  spikes)  that  are  intrin¬ 
sic  in  the  glint  signature.  Figure  6  is  a  sample  of  glint  gener¬ 
ated  for  a  complex  target  using  a  radar  target  model  developed  by 
Mumford  [12].  Mumford's  model  computes  the  backscatter  from  a 
complex  target  by  decomposing  the  complex  target  into  simple 
component  shapes  for  which  scattering  solutions  have  been  derived. 
This  modeling  technique  is  known  informally  as  "N-shape"  modeling. 
Angular  glint  is  defined  as  the  linear  displacement  of  the  center 
of  radar  reflection  from  the  defined  physical  center  of  the  target 
measured  along  a  line  in  the  plane  passing  through  the  target 
center,  at  right  angles  to  the  sightline,  to  the  physical  center. 
Its  magnitude  is  equal  to  the  tangent  of  the  angle  between  the 
true  and  apparent  directions  of  the  target  times  the  range. 

Linear  glint  is  simple  range  independent  angular  glint.  Thus, _ 
the  strategy  of  robustifying  the  Kalman  filter  is  based  on  making 
it  resistant  to  glint  spikes.  There  are  at  least  two  ways  to  im¬ 
plement  this  strategy.  The  first  way  is  to  preprocess  the  mono¬ 
pulse  radar  pulses  in  a  robust  manner  using  summary  statistics  as 
inputs  to  the  pointing  error  control  system  flowcharted  in  Figure 
1.  This  is  a  natural  approach  in  tracking  radars,  since  the  radar 
pulse  rate  is  much  higher  than  the  requisite  tracking  rate.  The 
second  way  is  based  on  the  robustified  Kalman  filter  developed  by 
Martin  [13] .  Both  of  these  techniques  are  outlined  in  this 
section. 

Generally,  for  a  complex  target,  the  resulting  glint  signature 
is  non-stationary  time  series.  Thus,  both  robust  techniques 
rely  on  adaptive  noise  estimates  of  the  input  noise  statistics. 

For  the  moment  we  ignore  the  interaction  of  target  maneuver  and 
the  glint  signature.  The  two  summary  statistics  of  the  input 
noise  that  are  required  are  an  estimate  of  location  and  scale. 

For  a  symmetric  distribution  estimate  of  location  is  the  center 
of  the  distribution,  while  an  estimate  of  scale  is  the  spread  of 
the  distribution.  For  a  Gaussian  distribution  the  optimijm  esti¬ 
mate  of  location  is  the  mean  and  optimum  estimate  of  scale  is  the 
variance  (standard  deviation) .  For  the  moment  in  this  discussion, 
the  correlation  structure  in  the  glint  signature  is  also  ignored. 
As  amply  demonstrated  by  Tukey  [7]  for  non-Gaussian  heavy  tailed 
symmetric  distributions  these  classical  estimates  are  unsafe. 

Following  Martin  [14]  the  following  definitions  are  introduced 
for  completeness.  Let  Y^,  .  .  .  ,  Y^  denote  a  univariate  data 

sample.  A  statistic  is  simple  a  function  of  the  data 

T(Y)  =  T(Y^,  .  .  .  ,  Y^) .  An  estimator  is  a  statistic  whose  value 

is  supposed  to  provide  an  indication  of  a  parameter  in  a  para¬ 
metric  statistical  model  for  the  data.  A  real  scalar-valued  esti- 
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FIGURE  6.  Location  Comparison  (N-Shape  Glint). 


mate  is  translation  invariant  if  for  any  real  number  C  and  the 
constant  n-dimensional  unit  vector  1 

-*■ 

T(Y  +  Cl)  =  T(Y)  ; 
translation  equivariant  if 

T(Y  +  Cl)  =  TCY)  +  C; 

scale  invariant  if  for  any  real  constant  C  >  0 
T  (CY)  =  T  (Y)  ; 


and  scale  equivariant  if 
T(CY)  =  CT(Y)  . 
The  sample  mean 


T(Y) 


Y. 


1 


is  translation  and  scale  equivariant.  The  sample  standard 
deviation 


S(Y) 


55^  (Y.  -  t(y))2 

is  translation  invariant  and  scale  equi variant.  The  robust  esti¬ 
mate  of  scale  CMADM  =  Median  Absolute  Deviation  from  the  Median 
=  median  -  median  (Y) |/.6745  is  translation  invariant  and 

scale  equivariant.  The  divisor  .6745  makes  CMADM  a  consistent 
estimate  of  the  standard  deviation  if  the  sample  is  drawn  from  a 
normal  population.  A  robust  estimate  of  location  can  be  achieved 
by  Winsorizing  (see  Huber,  p.  18)  the  data.  To  metrically  Winsor- 
ize  the  data,  the  observations  Y^,  .  .  .  »  Y^  are  replaced  by 

pseudo-observations.  In  this  study  the  pseudo-observations  were 
obtained  by  setting  all  observations  beyond  3S(Y)  equal  to  S{Y) 
and  then  recomputing  the  sample  mean.  This  procedure  is  transla¬ 
tion  invariant  and  scale  equivariant.  Another  robust-estimate  is 
Tukey's  biweight  [7,  p.  353],  which  is  defined  as 


T(Y) 


E  W.Y. 
1  1 


where 


<  1 


otherwise 


and  S  =  CMADM.  The  constant  C  is  callled  the  cutoff  parameter, 
in  this  study  C  =  6.  Biweight  is  translation  and  scale  equivari¬ 
ant.  Associated  with  each  of  these  estimates  is  an  influence 
function,  which  in  exploratory  data  analysis  governs  the  effect 
of  the  value  of  one  data  point  on  the  estimate.  Some  influence 
functions  are  graphed  in  Figure  7.  The  linear  influence  curve  in 
Figure  7(i)  is  the  influence  curve  for  the  sample  mean  and  shows 
that  the  sample  mean  is  directly  affected  by  a  change  in  one  data 
point.  The  influence  curve  labeled  (ii)  is  Huber's  monotone  func¬ 
tion  and  the  influence  curve  (iii)  is  Hampel's  two-part  redescend¬ 
ing  function.  The  role  of  these  influence  functions  in  robust 
statistics  is  discussed  in  Huber  [11] .  Both  of  the  latter  influ¬ 
ence  functions  ignore  changes  in  measurement  outside  of  a  band 
and  respond  within  the  band;  thus,  outliers  outside  the  acceptance 
band  are  rejected. 


Martin's  robustified  Kalman  filter  modifies  the  state  correction 
equation  and  the  conditional  error  covariance  matrix  equation 
with  a  suitable  scaled  influence  function.  Two  excellent  candi¬ 
dates  for  an  influence  function  in  the  Martin  filter  are  the  Huber 
and  Hampel  functions  in  Figure  7.  The  details  of  his  filter  are 
beyond  the  scope  of  this  paper  and  the  interested  reader  should 
consult  Martin  [13] . 
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Figures  6  and  8  illustrate  the  different  estimates  of  location 
and  scale  as  provided  by  the  different  statistics.  The  data  in 
Figure  6  represents  the  glint  signature  of  a  complex  target 
sampled  every  .001  degree  of  target  aspect  in  a  10  degree  sector. 
Each  of  the  statistics  in  Figures  6  and  8  are  derived  from  suc¬ 
cessive  non-overlapping  .02  degree  intervals  containing  20  sam¬ 
ples.  In  Figure  6  the  points  labeled  by  octagons  are  sample 
means,  the  triangular  labels  are  the  Winsorized  estimates,  the  X 
labels  are  sample  medians  and  the  pluses  are  center  or  Tukey's 
biweight. 

The  resistence  of  median  and  biweight  are  clearly  indicated  in  the 
44  to  45  degree  interval.  In  Figure  8,  square  labels  represent 
standard  deviation,  the  octagons  represent  the  Winsorized  esti¬ 
mates,  and  the  triangles  represent  CMADM.  Again,  the  resistance 
of  CMADM  versus  the  standard  deviation  is  clearly  illustrated. 


CONCLUSION 

Preliminary  results  indicate  that  the  classical  Kalman  filter  will 
be  a  suboptimal  design  in  the  presence  of  heavy  tailed  non- 
Gaussian  distributions.  The  theory  of  robust  statistics  and 
robust  Kalman  filters  offers  an  important  set  of  tools  that  can  be 
used  to  optimize  the  design.  Future  research  will  be  directed 
towards  this  goal. 
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ABSTRACT 

This  paper  describes,  by  means  of  a  missile  autopilot  design  case 
study,  a  technique  currently  in  use  for  determining  the  stability 
and  dynamic  characteristics  of  a  digital  control  system  in  terms 
of  several  selected  system  parameters.  The  method  requires  that 
the  system  characteristic  equation  be  available  in  the  complex  z- 
domain.  It  is  the  ability  to  handle  more  than  one  free  system 
parameter,  which  need  not  be  (but  often  is)  a  controller  gain 
which  makes  this  method  more  powerful  than  most  design  techniques . 
The  paper  also  demonstrates  the  generation  of  the  system  equations 
by  the  Systematic  Analysis  Method.  This  method,  an  alternative 
to,  for  instance  signal  graph  methods,  is  applicable  to  both  sim¬ 
ple  and  complex  systems.  The  final  design  was  evaluated  by  the 
method  of  Digital  Control  System  Response  by  Cross-Multiplication. 


INTRODUCTION 

This  paper  is  expository  in  nature  concerning  the  design  of  digi¬ 
tal  control  systems.  The  case  study  presented  in  this  paper  is 
that  of  the  design  of  a  missile  digital  autopilot.  The  methods 
used  in  the  design  are  those  currently  in  use  to  produce  such  de¬ 
signs  which  are  coming  about  with  ever  greater  frequency  since  the 
advent  of  the  ubiquitous  microprocessor.  The  paper  traces  through 
the  formulation  of  the  equations  of  motion,  the  rationale  for 
autopilot  transfer  function  selection,  the  application  of  the 
Parameter  Plane  Method  of  autopilot  parameter  selection  and  ends 
with  a  simulation  to  evaluate  the  missile  response,  at  the  sampl¬ 
ing  instants,  to  a  test  input.  References  are  included  so  that 
readers  may  pursue  in  greater  depth  if  they  desire  the  details  of 
the  various  procedures  used. 


PROBLEM  FORMULATION  AND  AUTOPILOT  DESIGN 

The  case  study  selected  for  this  paper  is  that  of  an  autopilot 
design  of  a  tactical  US  Army  missile.  To  formulate  the  study ^ 
mathematically,  the  equations  of  motion  of  a  missile  were  derived 
assuming  planar  motion.  Thus  two  differential  equations  are  re¬ 
quired,  one  describing  translation  of  the  missile's  center  of  mass 
in  the  plane,  the  other  describing  rotation  of  the  missile  air¬ 
frame  about  the  center  of  mass.  With  reference  to  Figure'  1  the 
following  quantities  are  defined: 

F  =4  engine  thrust  force 

A  A  aerodynamic  axial  force 

N  A  aerodynamic  normal  force  (note:  N  defined  a 

^  positive  quantity  in  the  negative  3b  direction) 
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M  A  aerodynamic  pitching  moment  about  the  center  of  mass 
Jlo  ^  distance  from  vehicle  center  of  mass  to  moment 
~  reference  point 

I  _A  polar  moment  of  inertia  of  missile  about  an  axis 
~  normal  to  the  plane  and  directed  through  the 
center  of  mass 

m  A  the  mass  of  the  missile 
g  A  the  acceleration  due  to  gravity 

In  ~2n  3n  A  designate  unit  vectors  in  the  n-coord- 
’  '  inate  frame  according  to  the  usual 

right  hand  rule  i.e. ,  In  x  2h  = 

3fl 

Rm  A  the  vector  relating  the  origin  of  the  r-coord- 
“  inate  system  to  the  origin  of  the  b-coordinate 
system  (thus  relating  the  missile's  center  of 
mass  in  the  b-coordinates  to  the  assumed  iner¬ 
tial  space  coordinates  i.e.  the  r- coordinate 
system) 

6  A  the  angle  between  the  missile's  longitudinal 
~  (roll)  axis  and  the  local  horizontal  (thus  an 
inertial  coordinate  under  the  non-rotating 
earth  assumption) 

V  A  the  missile's  velocity  vector^ 
a  A  the  angle  between  V  and  the  missiles  longitu- 
~  dinal  axis 

6'  A  aerodynamic  fin  deflection 


Figure  1 
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The  equations  of  motion  are  found  by  applying  D'Alembert's  Princi¬ 
ple  to  obtain  the  translational  and  rotational  differential  equa¬ 
tions 


m  to  =  (F-A)  lb  -  N  3b  +  mg  3r  (1) 

10  =  M  -  JloN  (2) 

The  vector  notation  has  been  omitted  from  equation  (2)  because  of 
the  assumed  one  degree  of  rotational  freedom. 

Typically  there  is  only  weak  interaction  or  coupling  between  the 
translational  equation  (1)  and  the  rotational  equation  (2),  Thus 
this  interaction  will  be,  for  the  purposes  of  this  paper,  ignored. 
It  is  further  assijmed  that  the  missile  is  in  nominally  level  flight 
Therefore  the  variable  0,  usually  denoted  by 

0  =  00  +  0(t)  (3) 


becomes  merely  9(t).  Linearizing  the  aerodynamics  leads  then  to 
a  transfer  function  between  vehicle  pitch  rate  and  fin  deflection 
of  the  form  ^ 


0(s)  b(s  +  d) 

6''(s)  ~  s^  +  2ca)nS  + 

For  a  particular  missile  in  level  flight  this  transfer  function 
becomes 

9(s)  _  (-216.6)  (s  +  0.693) 

6'  ~  7s ^  +  1.963  +  60.196)  *  (5) 

With  negligible  loss  in  modeling  fidelity  this  may  be  approxi¬ 
mated  to 


6(s)  (-216.6)  (s) 

6'  "  (s^  +  60.196) 


This  then  is  the  pitch  plane  rotational  transfer  function  with 
which  the  autopilot  is  to  work  such  that  the  actual  missile  angu¬ 
lar  rate,  0,  controlled  by  6 '(the  fin  angle)  corresponds  to  a  com¬ 
manded  angular  rate  0c,  furnished  by  the  missile  guidance  system. 
For  the  sake  of  simplicity  in  early  analyses  it  is  assumed  that 
the  vehicle's  angular  rate,  0,  is  sensed  perfectly  and  that  the 
fin  deflection,  5',  follows  perfectly  its  input  command.  If  it  is 
postulated  that  the  missile  rate,  0,  should  follow  a  step  command 
of  rate,  0c,  with  negligible  error  then  the  use  of  some  form  of 
integration  process  between  the  system  error,  0c- §,  and  the  fin 
deflection,  6",  is  suggested.  Recognizing  that  the  damping  of 
rotational  motion  of  the  missile  is  virtually  non-existent  and, 
indeed,  is  modeled  as  zero,  suggests  the  use  of  some  form  of  mis¬ 
sile  airframe  rate  feedback  to  fin  deflection,  6',  to  stabilize 
the  airframe  rotation.  The  most  efficacious  autopilot  algorithms 
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for  final,  eventual  flight  use  are  the  subject  of  further  study. 
However,  the  methodology  expostulated  above  allows  one  to  arrive 
quickly  at  workable  first  designs  which  could  be  evaluated  in 
three  or  six  degree  of  freedom'  simulations  and  tuned  up  as  the 
results  of  such  simulations,  as  well  as  more  information,  v.g. 
rate  sensor  and  actuator  models,  becomes  available.  Based  on 
these  considerations  one  possible  block  diagram  for  the  autopilot 


Figure  2 


Starring  denotes  a  sampled  quantity  and 

Di  denotes  a  digital  compensator  transfer  function 
Gho  denotes  the  transfer  function  of  a  zero  order  or 
"box  car"  hold 

Gr)  denotes  the  transfer  function  between  missile 
^  angular  rate,  9(s),  and  the  fin  deflection,  6  (s). 
T  denotes  sampling  period  of  the  s3mchronous 
samplers 


A  number  of  methods^  are  available  with  which  to  develop  the  trans¬ 
fer  function  between  6  and  Gc,  which  is  necessary  to  study  stabil¬ 
ity  and  performance.  The  method  used  here,  called  the  Systematic 
Analysis  Method  or  SAM,  is  a  general,  easily  applied  technique 
applicable  to  both  simple  and  complex  systems.  First  one  selects 
the  variables  at  the  inputs  to  the  samplers  as  the  unknowns.  Then 
the  original  equations  are  written  and  tabulated  in  Table  1.  it 
any  of  these  equations  contain  the  product  of  an  unsampled  system 
variable  and  an  unsampled  transfer  function  they  are  modified  by 
substitution  to  eliminate  unstarred  (or  unsampled)  variables  and 
tabulated  in  Table  1.  Finally,  a  column  in  the  table  is  con¬ 
structed  in  which  the  equations  from  the  first  or  second  steps, as 
appropriate,  are  transformed  into  the  sampled  domain  according  to 
the  relationships 


(RG)*  =  RG* 

(RG*)  =  R*G* 

(R  )*  =  R* 

Application  to  the  problem  at  hand  yields  for  the  first  column 

Original  Equations 


e(s)  =  9c(s)  -  G(s) 

6'(s)  =  D6(s)C2  £(s)  -  D6(s)C3  {Dd(s)  9*(s) 
9(s)  =  Gp(s)Giio(s)6"*(s) 
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and  for  the  second  coltimn 

Pulsed  Equations 

e*(s)  =  0j(s)  -  e*(s) 

=  Dt(s)C2e*(s)  -  Df(s)C3D3(s)e*(s) 
e*(s)  =  Gp(s)Gt^^(s)*  6'*(s) 

Table  1 

Lastly  the  desired  input  output  relationship  is  foxand  by  substi¬ 
tution  to  be 

,  *  (7) 

0*(s)  _  CaGpGho^ (s)®6 (s) 

(i+C2Dj(s)'  GhoGp^Cs)  +C3  D§(s)D;5(s)GhoGp^(s)) 


or,  expressed  in  the  z- domain, 

§  (2)  ^  _ C2  g;^(z)  D6(z) _ 

§c(z)  (l+CaDgCz)  GhoGp(z)  +  C3D5(z)  Dd(z)GhoGp^z)) 

Based  upon  the  design  goals  and  approaches  expressed  above  D5 
should  perform  an  integration  operation  whereas  Dd  should  be  se¬ 
lected  such  that  missile  rate  propagates  to  fin  deflection.  These 
considerations  suggest  lettine 


D6(z)Dd(z)  =  1  (9) 

and  D6(z)  =-31z-^  ^0) 

(z-1) 

where  it  is  recognized  that  equation  ten  corresponds  to  the  inte¬ 
gration  operation  by  means  of  a  rectangular  rule.  The  plant 
transfer  function,  Gp(s),  has  already  been  tabulated  as  equation 
(6).  The  first  order  hold  transfer  function,  Gho(s),  is  known 
to  be 

Ts 

Gho(s)  =  (11) 


Thus 


^  _  b  s 

Gp(s)Gho(s)  -  (s^+iond  - i 


,  /I  -TSs 

b(l-£  ) 

(  s  ^+a)n  ^  ) 


This  becomes,  in  the  z-domain. 


(12) 


GhoGp(z)  =  _  b(slm  coni')  (z-1)  ^  (13) 

(ji)n(z^-2  (cos  wnT)z+  1) 

Substituting  equations  (9)  ,  (10)  ,  and  (13)  into  (8)  yields  the  de¬ 
sired  overall  transfer  function 
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(14) 


e  (z)  _  (Ic^b  sin  a)nl)z 
e^Cz)  L(z) 

where  L(z)  is  the  polynominal  wn  z^  +  (1bC2sin  wnT  -  2a)n  cos  wn’^  + 

C  b  sin  a)nT)z  +  (ton  -  bCs  sin  tonl)  and  where  C2  and  C3  are  con- 
stants  yet  to  be  selected.  The  equation  will  be  examined  first  to 
see  how  the  missile  responds  to  a  unit  step  input.  Applying  a 
unit  step  of  command  and  using  the  final  value  theorem,  (assuming  a 

final  value  exists)  produces 

lim  e(nT)  =  lim  z  (z-1)  6(^ 

nT->oo  z^lCi^)^~  0cW  ^  ^ 


which  when  evaluated  yields 


V  _  T  C2b  sin  ton  T  _ 

“ '  '  T  C2b  sin  lonT  +  2  (on(l-cos  tonT) 


(16) 


To  continue,  T  was  selected  to  be  0.10  second  on  reasonableness 
grounds,  and  b  and  ton  are  missile  parameter  previously  stated 
viz.  -216.6  and  7.76  sec~\  Substituting  yields  an  expression  for 
the  limit  in  terms  of  the  free  parameter  C2  as  follows 


e(oo) 


C2 

C2  -  0.276 


(17) 


Defining  the  error  as  the  amount  by  which  the  output  fails  to 
attain  a  value  of  unity  allows  the  construction  of  Table  2 


7o  error  C2 


21.7  -1 

5.24  -5 

2.7  -10 

0.5  -50 


Table  2 

Thus  to  keep  the  error  indicated  by  this  particular  measure  of 
performance  below  a  few  percent  a  value  of  C2  less  than  -10  is^ 
indicated  (note  that,  as  will  be  shown  below,  C2  must  be  negative 
to  insure  a  stable  system) . 

What  remains  to  be  done  in  this  quick  look  design  procedure  is  to 
select  the  free  parameters,  C2  and  C3.  This  is  done  in  such  a 
way  as  to  bound  the  static  error  as  indicated  in  Table  2  and  to 
meet  stability  requirements.  Noting  that  there  are  two  free  para¬ 
meters  to  select  it  is  necessary  to  decide  upon  a  particular  tech¬ 
nique.  By  freezing  C3,  one  might,  for  example,  apply  root  locus 
or  frequency  response  techniques  in  the  z  and  w^  domains  respec- ^ 
tively.  However,  there  is  no  need  to  specify  either  parameter  in¬ 
dependently  of  the  other  if  one  utilizes  the  Parameter  Space 
Method®  of  design. 
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The  method  is  based  upon  analysis  and  synthesis  methods  described 
in  Siljak's  monograph  **  as  extended.  Briefly  put,  the  method 
allows  one  to  map  the  location  of  the  roots  of  the  system's  char¬ 
acteristic  equation  into  a  plane  whose  coordinates  are  the  system's 
free  parameters  and  which  is  readily  divided  into  regions  identi¬ 
fied  with  system  stability  and  instability.  It  is  noted  that  the 
free  parameters  need  not  be  gains  but  could  just  as  well  be  some 
other  system  parameter.  In  addition  to  the  characteristic  root 
locations,  such  things  as  contours  of  constant  relative  damping 
factors  and  specified  exponential  time  constants  may  be  transform¬ 
ed  as  contours  into  the  parameter  plane.  Thus  in  a  manner  somewhat 
reminiscent  of  the  classical  root  locus  a  portrait  may  be  presented 
of  all  pertinant  aspects  of  the  system's  transient  response  with, 
however,  the  cogent  difference  that  they  may  be  presented  as  func¬ 
tions  of  several  parameters  rather  than  the  simple  parameter  open 
loop  gain.  The  method  will  be  exemplified  by  application  to  the 
problem  at  hand  viz.  that  described  by  equation  (14). 

As  detailed  in  reference  three,  this  design  technique  deals  direct¬ 
ly  with  the  system's  characteristic  equation.  As  shown  in  the 
reference,  the  coefficients  of  the  characteristic  equation  are 
cast  if  possible  into  the  linear  form 

Z  (d^Ko  +  fA  Ki+  g4)zJ  (18) 

for  systematic  generation  of  the  computer  program  input  data.  If 
this  linear  combination  is  not  possible,  the  method  can  still  be 
used  but  becomes  more  tedious.  In  this  form  the  Ko ,  Ki  variables 
are  the  parameters  to  be  selected  i.e.  they  are  the  coordinates  of 
the  parameter  plane.  It  is  convenient  to  fill  out  a  table  where 
coltamns  are  d j ,  f  j  ,  g^ .  The  transpose  of  these  columns  provide 
the  input  to  the  computer  program  used.  Noting  that  the  denomin¬ 
ator  of  (14)  corresponds  to  this  problem's  characteristic  equation, 
allows  the  construction  of  Table  3  below. 

_j - ^ - dj - 1 - _lj -  -  - Sj - 

0  -b  sin  a)nT=  +2.1632  0  «  1.0 

1  b  sin  wnT  =  -2.16 32  ^  sin anT=- 2. 1632x10"  -2  cos wnT  =-1.994 

2  0  ton  0  1.0 

Table  3 

In  the  particular  computer  program  used  the  transpose  of  the  dj 
column  becomes  the  A  matrix,  the  transpose  of  the  fj  colixmn  the  B 
matrix  and  the  transpose  of  the  gj  column  the  F  matrix.  The  pro¬ 
gram  was  run  with  the  result  exhibited  as  Figure  3. 

By  applying  the  appropriate  "shading  rules"  (or  factoring  the  CE 
for  a  nianber  of  test  points)  it  can  be  established  that  the  inte¬ 
rior  of  the  triangle  in  the  C2,  C 3  plane  corresponds  to  system  sta¬ 
bility  i.e.  a  condition  wherein  all  roots  of  the  CE  lie  within  the 
unit  circle  in  the  z-plane.  Also  plotted  are  contours  of  constant 
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relative  damping  factor,  zeta,  for  two  values  i.e.  0.5  and  0.707. 


C3 


Figure  3 


Although  no  system  parameter  sensitivity  studies  were  performed  on 
this  example  it  seems  prudent  to  stay  some  distance  from  the  sta¬ 
bility  boundaries  at  this  point  in  the  design.  Thus  a  set  of  val¬ 
ues  for  Cz  and  C3  of  Cz=  -50  and  €3=  -0.4  was  chosen.  One  notes 
this  point  lies  between  the  0.5  and  0.707  zeta  contours  in  the  C2- 
C3  plane  and  at  some  distance  from  the  various  stability  boundar¬ 
ies.  In  addition,  the  choice  of  C2=  50  will  insure  a  small  error 
in  response  to  a  unit  step,  as  previously  discussed. 


SIMULATION  RESULTS 

An  evaluation  of  the  response  of  the  system  at  the  sampling  in¬ 
stants  is  presented  as  Figure  4.  It  was  evaluated  by  means  of  a 
computeiT  program  implementing  the  "Determination  of  Digital  Control 
System  Response  by  Cross-Multiplication"  as  presented  by  Seltzer  in 
reference  five.  The  input  to  this  program  consists  merely  of  the 
coefficients  of  the  nxamerator  and  denominator  polynomials  of  the 
overall  transfer  function  (equation  14)  and  for  plotting  purposes 
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the  sampling  interval,  T. 
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Figure  4 


From  the  Figure  and  the  associated  printout  it  is  seen  that  the 
maximum  percent  overshoot  occuring  at  the  sampling  instants  is 


7o  o .  s . 


1,126-1.000 

1.00 


X  100  -  12.6% 


which  occurs  at  the  third  (nT  =  0.02  sec)  sampling  instant.  This 
compares  favorably  with  Franklin  and  Powell's  ®  approximation  re¬ 
lating  percent  overshoot  to  relative  damping  factor  for  a  second 
order  system  with  no  finite  zeros 

which  is 

C  <  (0.6)(1  - 

here  then 

c  <  (0.6)  (1  -  =  0.52 


The  degree  of  correlation  can  be  judged  when  it  is  recalled  that 
the  C2,  C3  values  were  chosen  such  that  their  set  of  values  lay 
between  the  c  -  0.5  and  z  =  0.707  loci. 
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CONCLUSION 


Presented  in  this  paper,  by  means  of  the  case  study  of  a  digital 
autopilot  design,  is  a  modern  method  of  designing  digital  control 
systems  without  regard  to  continuous  system  considerations.  The 
Parameter  Space  Method  of  generating  free  system  parameter  values 
is  employed  to  ensure  stability  in  both  the  absolute  and  relative 
sense.  Sufficient  detail  has  been  included  such  that,  given  the 
availability  of  Parameter  Space  and  Response  Cross-Multiplication 
computer  programs,  one  may  rapidly  and  methodically  design  digital 
control  systems  and  keep  them  updated  as  plant  characteristics 
change.  This  capability  is  especially  useful  during  the  initial 
design  phases  of  a  system.  The  methods  presented  are  currently 
being  used  in  the  design  of  US  Army  missile  autopilots ,  space  vehicles 
control  systems  and  have  been  used  to  design  aircraft  control  sys¬ 
tems.  It  is  of  course  understood  that  when  final  configurations 
become  available  the  total  system  design  would  be  validated  by 
appropriate  simulation.  Such  simulation  would  include  time  vary¬ 
ing,  non-linear  and  certainly  more  detailed  descriptions  of  the 
system  and  its  components  v.g.  the  missile  fin  actuation  sub¬ 
system.  In  fact  such  additional  simulations  have  been  performed 
in  various  instances  and  confirmed  the  efficacy  of  the  design 
method.  Additional  work  needs  to  be  done  in  such  areas  as  map¬ 
ping  points  on  the  real  axis  of  the  z-plane  into  the  parameter 
plane  in  order  to  control  further  the  system  response  time. 
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GENERAL  INTRODUCTION 


INTRODUCTION 

A  new  guidance  law  is  derived  for  tactical  missiles  that  undergo  significant  acceleration  and  drag.  Past 
guidance  laws  based  on  constant  velocity  missile  dynamic  engagement  models  result  in  sub-optimal 
trajectories  when  applied  to  missiles  used  most  often  in  tactical  engagements.  Recall  that  in  most 
tactical  engagements  the  missile  accelerates  (booster  burn,  or  "boost  to  mach")  and  coasts  to 
intercept.  The  new  guidance  law  is  based  on  a  more  complete  dynamic  engagement  model  that 
contains  the  missile  expected  axial  accelerations  and  the  control  direction  constraint  normal  to  the 
missile  velocity  vector.  The  non-linear  two  point  boundary  value  problem  [1]  is  solved  by  separating  the 
missile  axial  velocity  dynamics  from  the  control  dynamics  to  obtain  a  time  varying  linear  dynamic 
constraint  model.  The  results  are  verified  by  simulation,  and  indicated  significant  performance 
improvement  in  a  crossing  target  surface  to  air  engagement. 

The  history  of  short  range  missile  guidance  is  very  rich,  for  an  excellent  summary  the  reader  is  referred 
to  an  article  by  Pastrick  et.  al.  [2].  We  assume,  as  before,  that  a  well  designed  missile  autopilot  is 
implemented;  so  that  for  guidance  law  derivation  we  can  assume  a  simple  relationship  between 
commanded  accelerations  and  missile  response  [1 -6,8,9].  We  also  assume  that  the  intercept  time  is 
know  or  approximately  known  by  estimation,  [2,3,9].  Under  benign  conditions  any  monotonic  function  of 
predicted  miss  can  serve  as  the  guidance  law.  However  the  more  optimal  use  of  hardware  resources  is 
made  possible  by  implementation  of  more  nearly  optimal  guidance  laws  which  will  translate  into  more 
robust  guidance,  or  wider  engagement  envelope.  The  optimal  guidance  law  derived  here  can  be 
reformulated  as  a  biased  PNG  guidance  law.  Hence  the  bias  values  can  be  computed  at  a  reduced 
rate  for  computer  implementation  economy.  The  traditional  approximations  are;  Two-dimensional 
motion.  Point-mass,  Instantaneous  Control  response,  linear  dynamic  model,  and  constant  speed  [9].  Non 
linearities  are  traditionally  ignored  to  facilitate  the  use  of  linear  optimization  methods  to  obtain  the 
desired  feedback  control  law.  In  this  note  we  retain  the  Point-mass  approximation,  and  use  a  slightly 
more  accurate  dynamic  model,  and  incidentally  provide  a  new  explanation  of  three  dimensional  PNG. 

PAST  RESULTS 

The  Proportional  Navigation  Guidance  (PNG)  was  shown  to  be  optimal  by  Bryson  [1]  if  neither  missile  or 
target  are  accelerating.  Three  dimensional  guidance  including  the  effects  of  accelerating  target  and 
new  methods  of  estimating  the  time  of  intercept  have  been  presented  [4,3].  Recall  that  ’optimal’  has 
only  meaning  relative  to  the  cost  function  minimized  and  the  dynamic  constraint  model  used.  The 
relative  completeness  of  the  dynamic  constraint  model  is  the  issue  of  this  note.  Expected  missile 
acceleration  have  in  the  past  been  included  in  the  time-to-ao  to  intercept  calculation,  but  not  included 
in  the  dynamic  constraint  model  used  in  the  optimal  guidance  law  calculation.  Because  of  past  use  of 
inconsistent  dynamic  constraint  models  [1,3-5],  the  resulting  guidance  laws  generated  unusable  control 
commands  along  the  missile  velocity  axis. 

in  this  paper  we  also  show  that  the  same  constant  missile  and  target  dynamic  models  are  used  in 
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deriving  both  PNG  [1]  and  the  ’optimal’  three  dimensional  control  law  of  Riggs  [3].  And  hence  the 
improved  guidance  performance  reported  in  [3]  is  the  result  of  using  better  time-to-intercept 
calculations  than  is  implicit  in  PNG. 

The  problem  of  target  tracking  in  the  presence  of  noise  is  not  considered  here.  However  the  improved 
guidance  law  presented  here  leaves  more  control  authority  for  countering  unexpected  target 
maneuvers.  Also  the  target  is  assumed  to  be  non  accelerating,  control  modifications  to  account  for 
random  target  accelerations  might  be  included  as  in  [4].  We  next  summarize  the  result  of  Reference 
[3],  for  use  as  the  starting  point  for  the  developments  of  this  note. 

A  LINEAR  OPTIMAL  GUIDANCE  PROBLEM  SOLUTION 

Consider  the  dynamic  model.  Equation  (1),  of  the  missile  engagement  relative  to  the  target,  with  no 
target  acceleration,  as  used  in  References  [3-5].  The  optimal  guidance  problem  (or  the  two  point 
boundary  value  problem)  is  to  minimize  the  cost  function  (2)  subject  to  dynamic  constraints  (1),  with 
time  of  intercept  given. 


dR/dt  =  V 
dV/dt  =-u 

where  : 

R  =  relative  position  vector  of  missile  from  target  in  inertial 

coordinates 

V  =  -Vp,  =  relative  velocity  vector  of  missile  from  target 

u  =  missile  control  acceleration  vector  (not  constrained  in 

direction  or  magnitude) 


(1) 


The  cost  function  ’  J’  is  given  by  (2): 


J  =  R(tf)-  R(tf)/2  +JB  u(t)  •  u(t)/2  dt 


(i.e.  terminal  miss  plus  integrated  cost  of  control  weighted  by  B ). 
Recall  from  references  [1  &3]  that  the  solution  to  the  optimal  guidance  problem  is: 

U  =  -  Atg[R  +  Vtg]  /(3B  +  tg3) 

where:  tg  =  (tj  •  t)  =  estimated  time-to-go  to  intercept  is  given 


(2) 


(3)  * 
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OPTIMAL  VERSUS  PROPORTIONAL  GUIDANCE 


It  is  well  to  note  that  the  dynamic  model  Equation  (1)  contains  no  provision  for  the  expected  missile 
boost  acceleration  and  drag.  Also  note  that  the  acceleration  control  vector  ’u’  is  in  the  plane  spanned 
by  (R,V)  vectors.  Since  the  missile  velocity  vector  ’V^,’  is  not  necessarily  in  the  (R,V)-plane,  the  control 
vector  ’u’  may  have  a  component  along  ’V^’.  In  other  words  axial  acceleration  is  commanded  which  is 
typically  not  usable. 


In  appendix  A  it  is  show  that  Equation  (3)  reduces  to  the  familiar  Proportional  Navigation  Guidance 
(PNG)  law  Eq.  (4),  if  (a)  ’tg’  is  estimated  by  range/range  rate,  and  (b)  use  of  control  effort  is  not 

penalized,  (B  =  0). 


u  =  -  A  a  Rp  (4) 

where 

b  =  angular  rotation  rate  of  the  line-of-sight 
vector  in  inertial  space  (**) 

R  =  missile  closing  rate  on  target 

A  =  guidance  gain  factor  equal  to  3; 

used  as  a  design  parameter  with  value  3  to  6 

p  =  unit  normal  to  line  of  sight  vector  in  the  (R,V)  plane 


•  See  Bryson  [1,  pp287;  Eq.  9.4.27]  The  equivalence  is  apparent  if  the  energy  coefficients  are 
equated  as  follows:  Cp  =  1 ,  and  Cg  =  0,  with  A  =  3. 

(*•)  This  concept  of  coordinate  free  vector  rotation  in  three-space  is  the  key  to  generalized 
Proportional  Navigation  concept.  To  appreciate  the  simplification  achieved  by  this  concept 
compare  the  results  developed  here  with  [7]. 


141 


In  other  words,  the  improved  guidance  performance  of  Eq.  (3)  is  the  result  of  including  the  effect  of 
expected  missile  axial  acceleration  in  the  time-to-go  estimation;  since  both  guidance  laws  (3)  and  (4) 
are  based  on  dynamic  constraint  models  that  do  not  include  anticipated  missile  accelerations.  Next  we 
develop  the  more  accurate  dynamic  constraint  model  that  includes  the  expected  missile  acceleration, 
which  together  with  the  cost  function  results  in  the  improved  guidance  law.introdueaef  above. 


NEW  OPTIMAL  CONTROL  LAW  FOR  ACCELERATING  MISSILES 


The  new  optimal  control  law  is  based  on  the  two  point  boundary  value  mathematics  [1]  as  above,  but 
using  a  more  complete  dynamic  constraint  model  as  derived  next.  Consider  the  inertial  acceleration 
equation  (1),  with  the  missile  inertial  acceleration  expressed  as  orthogonal  components  relative  to  the 
missile  velocity  vector  as  follows: 


dV^/dt  =  GxVn,  +  (5) 

/  V  boost/drag  acceleration  of 

y  ''  missile,  not  controllable 

/  control  acceleration  ’u’  normal  to  in  the 
plane  spanned  by  (Rs>V^)  where  the  Rg 

vector  is  from  the  missile  to  predicted 
intercept  point. 


Equation  (5)  constitutes  the  modifications  to  (1)  which  results  in  the  new  more  accurate  dynamic 
constraint  model  (6)  summarized  below. 


MISSILE  TARGET  DYNAMIC  MODEL  THAT  INCLUDES  EXPECTED  MISSILE  ACCELERATIONS 


The  new  dynamic  missile-target  engagement  model  is  given  by  Eq.(6). 


dR/dt  = 

V 

dV/dt  = 

•  a~  •  u 
m 

^m  *  ■ 

C  a  for  Ktjj 

missile  axial 
acceleration 

l-ap  fortMb 

boost  and  drag 

• 

0  = 

= 

inertial  turning  rate  of  missile 
velocity  vector 

> 


(6) 


where; 


a ,  Sq 


V 


m 


u 


tb 


missile  expected  boost  &  drag  acceleration 
along 

missile  velocity  vector  in  inertial  coordinates 

control  acceleration  applied  in  the  plane 
normal  to  the  missile  velocity  axis. 

(the  direction  is  defined  later  by  the  control  law) 

missile  boost  burn  out  time 


As  noted  above,  this  model  is  more  complete  than  past  dynamic  constraint  models  used  for  guidance 
law  derivation.  Selected  variables  are  illustrated  in  Figure  1 .  below. 


TARGET 


PREDICTED  INTERCEPT 
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NEW  OPTIMAL  CONTROL  LAW  SOLUTION 


The  non  linear  dynamic  constraints  are  changed  to  a  linear  time  varying  constraint  by  use  of  the 
assumed  separability  of  the  missile  axial  dynamics  from  the  control  dynamics,  see  Eq.  (6).  It  is 
convenient  to  formulate  the  ’zero  effort  predicted  miss  function’  normal  to  the  missile  velocity  vector  (*). 
The  other  inertial  coordinate  system  axis  is  along  the  missile  velocity  vector;  for  which  the  time  history 
of  the  missile  is  specified  by  the  expected  accelerations,  (see  time-to-go  calculation  in  Appendix  B).  In 
actual  implementation  of  the  guidance  law  the  missile  target  data  may  be  available  in  arbitrary 
coordinate  system  which  we  anticipate  by  our  use  of  coordinate  free  vector  notation. 


Consider  Eq.  (6)  with  the  inertial  axis  oriented  along  the  missile  velocity  axis  so  that  0  is  small.  In  this 
case  the  component  of  predicted  zero  effort  final  miss  orthogonal  to  the  missile  velocity  axis  is: 


z 


dt 


+  Zq  +  0  Rg  n 


(7) 


where: 

=  (Vj  •  n  )  n  =  component  of  target  velocity 
normal  to  missile  velocity  axis 


n  =  unit  normal  to  in  the  (Rg.Vj^)  plane 

Zq  =  (  R  •  n)n 

=  component  of  range  vector  (-R) 
normal  to  missile  velocity  axis 

0  =  0  =  current  attitude  of  in  the  Inertial  reference  frame 


Figure  1  is  a  summary  of  the  above  illustrating  the  variables  of  interest  for  this  guidance  law  derivation. 

Differentiating  Eq.  (7)  and  substituting  from  Eq.  (6)  yields  the  terminal  miss  dynamic  equation  (8)  in 
terms  of  the  control  vector: 

*  Past  guidance  law  predicted  miss  formulations  are  normal  to  the  line  of  sight  vector,  [1, 2,4,5];  which 
is  also  implicit  in  [3]  because  of  the  equivalence  to  PNG  presented  above. 
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Z  =  -U  Rg/V^  (8) 

where 

Rg  =  ■  i^a/2  +  (a  +  ap)  U  •Tx)Us(|  -T^  )/2  (9) 

=  the  predicted  path  length  traveled  by  the  missileto  intercept  in  terms  of  time-to-go  m 

=  the  time  between  target  intercept 

and  missile  boost  burn  out  time  ( =  tf-tjj) 

Ug  =  the  unit  step  function 

4  =  variable  of  integration  equal  to  ’tg' 

Using  the  familiar  methods  of  optimal  control  [1]:  we  obtain  the  control  in  terms  of  final  predicted  miss 

(z,)  : 

u  =  -(Zf/B)(Rg/Vn,)  (10) 

Equation  (10)  is  substituted  into  (8)  to  obtain  the  terminal  miss  differential  Equation; 

z  =  -(Zf/B)(Rg/Vj2  (11) 

With  some  effort  Equation  (11)  is  solved  ano  is  substituted  into  Eq.  (10)  to  give  the  desired  optimal 
guidance  law  Eq.  (12): 

(12) 
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Where  ; 


+  F(aD  ,V|.V,  ,Vb)  •  FOd  ,Vf,Vj  ,V{)  )Us(tt,-t) 


+  {  F(a^  .V,.Vj  ,V^)  -  F(a^,Vf,Vf,Vf)}U3(t-tb) 

The  function  ’D’  is  expressed  in  terms  of  the  unit  step  function  ’Ug(.)’  and  function  ’F’  which  is  defined 
as  follows: 

F(a,Vg,Bv.V)  = 

{  .v3/4  +  V^V^-  (Vj.2  +  B/2)V  •  (B/2)2  /V  }/a^ 


where: 


By  =  Vp  •  (a  +  aQ)a  Tj^ 

V{  =  estimated  final  velocity  of  the  missile 
Vjj  =  estimated  missile  velocity  at  burn  out 
Vg  =  Vf  +  (a  +  aQ)Tx 


It  is  well  to  note  that  the  above  guidance  law  reduces  to  the 

law  if  the  missile  axial  velocity  remains  constant  and  if  the  missile  axis  is  aligned  with  the  Ime-of-sight  to 
the  target.  The  scalar  function  ’D’  becomes  zero  for  constant  missile  velocity.  The  factor 
denotes  expected  missile  path  length  to  target  divided  by  missile  velocity  becomes  ’tg’,  the  time-to-go. 
And  the  remaining  zero  effort  miss  distance  ’z’  becomes  equal  to  the  product  of  Range,  angular  line  of 
sight  rate,  and  ’tg’  ,z  =  (a  R  tg). 
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SIMULATION  RESULTS 


To  illustrate  the  guidance  laws,  we  simulate  a  Surface-to-Air  (SAM)  engagement  scenario.  For  this 
guidance  law  performance  evaluation,  we  represent  the  "real  world  missile  by  Eg.  (1)  together  yyith  a 
first  order  lag  imposed  on  the  control  acceleration  of  0.3  sec;  and  a  maximum  limit  on  control  of  3g. 


THE  ENGAGEMENT  GEOMETRY  AND  RESULTS 


The  target  travels  at  a  constant  velocity  of  500  ft/sec,  and  is  crossing  initially,  at  vvhich  time  the  missile 
velocity  is  50  ft/sec:  see  Figure  1.  The  missile  accelerates  for  2  sec  at  20g  .  Figure  2  indicates  the 
benefit  of  the  more  complete  guidance  law,  Eq.  (12),  note  the  reduced  miss  distance.  Figure  3 
indicates  the  control  accelerations  associated  with  the  trajectories  indicated  in  Figure  (2);  note  that  the 
new  guidance  law  uses  less  control  effort  to  intercept  the  target  than  the  other  guidance  laws  (3)  and 
(4),  leaving  more  control  authority  to  counter  unexpected  target  maneuvers.  Recall  that  the  missile 
axial  velocity  is  assumed  not  controllable,  hence  a  component  of  the  commanded  acceleration  due  to 
Eq.  (4)  or  (5)  along  the  missile  velocity  vector  is  not  used. 

Recall  that  the  true  time  of  intercept  (tj)  is  unknown,  hence  various  algorithms  have  been  proposed  to 
estimate  tg  =  (tj-t),  the  remaining  time  to  intercept  [2,3,6]. 

Further  guidance  law  refinement  are  possible  if  we  include  the  effect  of  missile  auto  pilot  dynamic  delay 
into  Eq.  (6).  Another  improvement  is  the  inclusion  of  control  normal  to  the  body  axis  rather  than  normal 
to  the  velocity  vector  as  assumed  above. 

Of  potentialiy  greater  performance  benefit  would  be  the  combining  of  the  target  tracking  and  missile 
guidance  problems. 


CONCLUSION 


We  have  derived  a  new  guidance  law  for  missiles  experiencing  significant  axial  acceleration.  This 
guidance  law  is  demonstrated  to  give  superior  intercept  performance  compared  to  PNG  and  another 
"optimal"  guidance  law  in  a  surface-to-air  intercept  scenario.  This  guidance  law  is  obtained  through  the 
use  of  modern  control  methodology  applied  to  a  more  complete  kinematic  missile  target  engagement 
model.  The  kinematic  constraint  model  used  includes  the  missile  axial  accelerations  and  the  constraint 
on  the  direction  of  the  missile  control  effort  normal  to  the  velocity  vector.  The  resultant  non-linear 
dynamic  model  is  solved  by  first  solving  the  axial  dynamic  equation,  which  results  in  a  time  varyjng 
linear  dynamic  constraint  model.  Recent  "optimal"  guidance  laws  have  used  missile  acceleration 
estimates  in  the  estimation  of  the  "time-to-go"  to  intercept,  however  have  not  included  acceleration  in 
the  missile-target  dynamics  as  in  this  note. 


■The  constructive  comments  of  Prof.  J.  Bossi  is  appreciated.  And  the  help  of  Mr.  B.  Isham  in  obtaining 
the  results  of  Appendix  A  and  the  extensive  computer  simulations  is  greatfully  acknowledged.  Finally  I 
am  indebted  to  Mr.  A.  J.  Witsmeer  for  providing  the  support  that  made  this  study  possible. 
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BSTHM  1221DXXS  PNG,  *5nM»  SCEMfiPlO 
GN  =  3.0;  B  =  0.000;  R1  =  236.006  fiT  TI  =  2.580 


Figure  2. 

Relative  Trajectory  of  Missile  to  Target  Illustrating  Sensitivity  to  control  laws; 
Equations  (3),  (4)  and  (12),  resulting  in  terminal  miss  distances  of  18.5  ft,  288 
ft,  and  1.4  ft  respectively.  With  time-to-go  estimated  by  Eq.  (13). 
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Figure  3. 

Control  acceleration  normal  to  missile  velocity  axis;  note  that  less 
control  authority  is  used  with  new  guidance  law,  leaving  more  for 
countering  target  maneuvers.  Maximum  control  effort  is  limited  to  3 

g's. 
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APPENDIX  A  "OPTIMAL"  VS.  PROPORTIONAL  GUIDANCE 


The  optimal  guidance  law  of  Riggs  et.  al.  [3]  is  shown  to  be  equivalent  to  PNG  under  less  restrictive 
conditions.  The  conditions  for  the  equivalence  of  equations  (3)  to  equation  (4)  are:  (a)  the  time  to  go 
’tg’  intercept  is  range  divided  by  range  rate,  and  (b)  the  optimal  guidance  cost  function  assign  zero 
penalty  to  control  effort;  see  the  cost  function  Equation  (2). 


Consider  figure  Al ,  where  we  indicate  the  orthogonal  coordinate  system  with  one  axis  along  the  line  of 
sight  vector  ’R’,  and  the  normal  ’p’  to  ’R’  in  the  (R,V)-plane. 


Figure  Al  Vector  and  Scalar  quantities  for  three  dimensional  PNG 
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Consider  the  orthogonal  unit  basis  vectors  (r,p)  along  the  line  of  sight  vector  and  normal  in  the  (R,V)-plane 
as  indicated  in  Figure  A1 . 


It  is  convenient  to  define  the  following  : 

R  =  R  r  =  the  line  of  sight  vector  j 

V  =  Rr+VpP=  relative  velocity  ( V  (A1) 

tg  =  -R/R  time-to-go  to  intercept  j 

Substituting  definitions  A1  into  Eq.  (3)  with  B  =  0,  gives: 

u  =  -A  tg  [  R  +  (R  r  +  Vp  p)  tg  ]  /  tg3  (A2) 

u  =  -A  r2  [  R  r  +  R  (  -R/R)  r  +  Vp  (-R/R)  p  ]/  r2  (A3) 

•J  =  Vp  (R  /R  )  p  =  -A  R  CT  p  (A4) 


Where  the  concept  line  of  sight  vector  rotation  rate  in  inertial  space  is  represented  by  a.  Note  that  Equation 
A4  is  the  familiar  PNG  guidance  law  with  the  gain  factor  A  =  3;  typically  A  is  made  to  take  on  values 
between  3  and  6, in  order  to  more  quickly  head  the  missile  to  the  predicted  intercept  point.  Various  time-to- 
go  ’tg’  algorithms  have  been  developed  [2,3],  it  is  noted  that  the  use  of  smaller  than  true  ’tg’  improves 

performance,  and  is  equivalent  to  varying  A.  This  observation  is  indicative  of  the  inconsistency  of  using  the 
above  control  laws  (3,4)  in  a  situation  where  the  actual  missile  dynamics  does  ngi  have  axial  velocity 
control,  and  moreover  experiences  uncontrolled  boost  acceleration  to  ’mach’  and  drag. 
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APPENDIX  B  TIME-TQ-GO  TO  INTERCEPT  ESTIMATION 


The  estimation  of  the  time  of  target  intercept  is  motivated  by  the  subsequent  great  simplification  of  the 
general  (free  final  time)  two  point  boundary  value  problem  [1]. 


Consider  Equation  (6),  and  note  that  for  zero  control  effort,  the  missile  velocity  and  path  length  is  uniquely 
specified  by  (a)  the  initial  velocity,  (b)  the  expected  axial  acceleration,  and  (c)  the  time-to-go  to  intercept. 
Both  the  missile  velocity  and  the  remaining  path  length  are  expressed  in  terms  of  ’tg’,  hence  the  indicated 

solution  (B1 )  is  obtained. 

tg  =  [•  C.|  •  Sqrt(  C.|  ^  •  2  aQ  Cq  )  ]/  aQ  (B1 ) 


where  during  missile  boost;  or  t<  tjj  and  tj  >  tjj 


Cq  =  R  (a-aD)(tj,- 1)2/2 
=  dR/dt  +  (a-a0)(t^jt) 


and  for  after  burn  out  we  use;  (  t>  tjj,  tj  >  t|j ) 


Cg  =  R  :  and  C.|  =  dR/dt 


where: 


tjj  =  missile  boost  burn  out  time. 


For  convenience  we  repeat  a  time-to-go  estimation  algorithm  used  in  Reference  [3];  and  provide  a 
comparitive  illustration  in  Figure  B1. 
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Figure  B1 

Comparison  of  time-to-go  to  intercept  algorithms  in  a  tail  chase  scenario;  tg^ 
is  implicit  in  PNG,  tg3  is  from  Reference  [3],  and  tg^is  a  simplified  form  of 
Equation  (B1)  for  the  case  where  drag  acceleration  is  much  smaller  than  the 
boost  acceleration.  1 53 
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ABSTRACT 

Adaptive  Learning  Network  synthesis  has  been  used  to  develop  a 
new  class  of  guidance  laws  for  short-range  air-to-air  missiles. 
ALN  guidance  laws  are  trained  to  estimate  target  range,  range 
rate,  time-to-go,  and  other  trajectory  parameters  not  explicitly 
observable  by  passive  sensors  but  required  to  implement  advanced 
guidance  laws.  This  development  permits  economical  implementa¬ 
tion  of  modern  optimal  guidance  laws  in  passive  systems,  yielding 
considerable  performance  improvements  over  other  passively  real¬ 
izable  guidance  laws. 


INTRODUCTION 

Research  and  development  activities  directed  toward  improved 
guidance  laws  for  tactical  air-to-air  missiles  have  been  vigor¬ 
ously  pursued  over  the  last  thirty  or  more  years  [1]  .  The  most 
impressive  improvements  have  come  from  guidance  laws  developed 
using  optimal  control  theory  [2,3,5] .  Unfortunately,  the  best 
performing  of  the  modern  guidance  laws  require  knowledge  of  the 
relative  missile-target  position,  velocity,  and  acceleration. 
Typically,  air-to-air  missiles  are  equipped  only  with  passive 
sensor  systems  (infrared  seekers)  so  straightforward  implementa¬ 
tion  of  modern  guidance  laws  is  not  possible.  Additionally,  most 
optimal  guidance  laws  also  require  an  estimate  of  time-to-go. 

The  traditional  solution  to  the  problem  of  implementing  modern 
guidance  laws  when  faced  with  limited  information  has  made  use  of 
Kalman  filters  to  estimate  the  unknown  states  [4] .  This  approach 
requires  that  the  missile  be  equipped  with  fairly  sophisticated 
data  processors,  and  does  not  solve  the  problem  of  estimating 
time-to-go. 

Recently,  a  new  approach  to  implementing  modern  guidance  laws  in 
passive  systems  has  been  developed  using  Adaptive  Learning  Net¬ 
works  (ALNs)  to  estimate  the  unknown  combinations  of  data  [6] . 
In  this  approach,  ALNs  are  "trained"  to  mimic  the  modern  guidance 


*Work  supported  by  the  Department  of  the  Air  Force,  Armament 
Division,  Eglin  Air  Force  Base,  Florida,  under  Contract  F08635- 
79-C-0220. 
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law,  using  only  passively  available  information.  Because  the 
ALNs  are  trained  from  a  library  of  simulated  engagements,  it  is 
now  possible  to  implement  a  wide  variety  of  guidance  laws  that 
are  not  amenable  to  Kalman  filter  approaches.  For  instance,  non- 
deterministic  or  iterative  solutions  can  be  implemented  once  the 
appropriate  training  data  base  is  obtained. 

The  use  of  trainable  ALN  guidance  laws  also  has  a  considerable 
impact  on  hardware  implementation  requirements.  The  trained  ALN 
algorithms  are  a  computationally  simple  set  of  algebraic  expres¬ 
sions.  They  can  be  realized  using  simple  microprocessor  cir¬ 
cuitry. 


ADAPTIVE  LEARNING  NETWORK  SYNTHESIS  METHODOLOGY 

The  classical  approach  in  the  design  of  signal  processing  func¬ 
tions  has  been  to  determine  explicitly  all  of  the  relevant  char¬ 
acteristics,  deterministic  and/or  statistical,  of  the  process 
being  observed  and  to  use  these  measurements  with  simplifying 
assumptions  in  the  design  synthesis.  Often  the  mathematical 
structure  of  the  processor  is  assumed  and  its  design  consists  of 
calculating  the  values  of  the  coefficients  in  this  structure.  In 
many  applications,  however,  little  is  known  about  the  character¬ 
istics  of  the  structure  and  the  best  or  even  an  acceptable  struc¬ 
ture  for  the  process  cannot  be  determined  a  priori.  In  these 
cases,  it  is  desirable  to  determine  the  model  structure  as  well 
as  the  model  coefficients  from  a  representative  data  base.  The 
Adaptive  Learning  Network  (ALN)  methodology  [7,8,9]  provides  a 
realization  of  this  goal.  Many  useful  references  are  found  in 

Reference  10. 

To  explain  the  ALN  approach,  let  us  assume  that  we  observe  a 

scalar  variable  y,  called  the  output,  and  N  other  variables,  Xt, 
Xo,  ...r  Xvj  which  are  called  inputs  (in  the  present  case,  the 
desired  output  is  a  command  to  a  missile  control  system;  the 

independent  variables  are  data  available  from  the  sensor  sys¬ 
tem)  .  The  inputs  are  also  referred  to  as  the  observables  or 
independent  variables  and  the  output  as  the  dependent  variable. 
Here,  "independent"  means  independently  observed;  the  input 
variables  need  not  be  statistically  independent.  We  seek  a 

relationship  between  the  independent  and  dependent  variables 
that,  in  general,  is  a  nonlinear  function; 

y  =  f(x2^,  X2,  ...,  C2,  ..♦,  Cj^)  (1) 


Here  and  elsewhere,  the  caret  is  used  to  denote  the  calculated 
value  of  the  indicated  variable.  If  f  is  known,  then  estimates 
of  the  L  coefficients  (c^^,  C2,  •••/  obtained  by 

minimizing  M  differences  between  the  calculated  and  observed 
dependent  variables  in  the  least-squares  sense. 
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In  many  applications,,  however,  the  function  f  is  not  known  and 
simplified  analytical  models  fail  to  reflect  the  complexity  of 
the  observed  data.  In  these  cases,  the  ALN  methodology  provides 
an  empirical  solution  to  the  structure  of  f  based  on  a  network 
interconnection  of  basic  elements  whose  individual  structures,  as 
well  as  the  interconnections,  are  learned  from  a  representative 
data  base. 


Under  fairly  general  conditions,  a  function  of  N  variables  (x^^, 
X2,  ...,  Xjj)  may  be  expressed  as  a  power  series. 
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This  polynomial  expression  in  many  variables  is  referred  to  as  a 
multinomial  expression  or  a  Kolmogorov-Gabor  polynomial  [11].  It 
is  extremely  difficult  to  evaluate  the  coefficients  of  Kolmo¬ 
gorov-Gabor  polynomials  when  there  are  more  than  several  vari¬ 
ables.  The  evaluation  is  considerably  easier  when  the  variables 
are  introduced  successively,  which  can  be  done  in  pairs  using 
second-order  partial  polynomials,  also  called  basic  elements: 

^  2  2 

y^k  ”  '^O  '^l^j  '^2*k  '^S^k 


Here  y.,  is  used  to  denote  the  output  of  the  basic  element  to  in¬ 
puts  x^^and  Xj,,  and  the  Wj^'s  (i=0,l,  ...,  5)  are  the  coefficients 
of  the'^  basic  element  whose  values  are  determined  by  the  least- 
squares  error  criterion. 


Training  of  the  ALN  (ALN  synthesis)  consists  of  building  up, 
layer  by  layer,  a  structure  that  synthesizes  the  Kolmogorov-Gabor 
polynomial  by  using  basic  elements  y.;]^.  In  the  first  layer  of 
the  network,  N(N-l)/2  basic  elements  ^are  constructed  from  the  N 
inputs.  Some  of  the  best  performing  input  variables  are  also 
used  to  form  triplets  and  third-order  partial  polynomials.  Only 
basic  elements  producing  acceptable  model  errors  are  allowed  to 
pass  their  outputs  to  the  second  layer.  The  outputs  of  each 
surviving  basic  element  as  well  as  the  original  features  are  used 
as  inputs  to  the  second  layer  of  basic  elements.  This  process 
can  be  repeated  with  succeeding  layers  until  overfitting  is 
detected. 


159 


physical,  and  engineering  processes  [10] .  Application  of  the  ALN 
methodology  has  been  successful  where  conventional  and/or  theo¬ 
retical  modeling  techniques  have  produced  poor  results. 


APPLICATION  TO  GUIDANCE  LAW  SYNTHESIS 

To  synthesize  an  ALN  guidance  law,  the  ALN  is  trained  on  example 
guidance  commands  for  a  variety  of  typical  engagements.  This  is 
accomplished  by  building  a  training  data  base:  an  advanced  guid¬ 
ance  law  (usually  a  law  requiring  complete,  explicit  observabili¬ 
ty  of  the  target's  trajectory)  is  used  in  conjunction  with  a 
computer  program  that  simulates  air-to-air  engagements,  and  the 
analyst  compiles  a  library  of  typical  simulated  engagements.  The 
library  ideally  encompasses  the  range  of  launch  conditions  over 
which  the  resulting  guidance  law  is  expected  to  perform.  For  ex¬ 
ample,  launch  range,  initial  aiming  error  (of f-boresight  angle), 
and  the  target  aspect  angle  are  important  launch  parameters.  At 
each  time  step  of  each  simulated  engagement,  outputs  available 
from  the  target  seeker  and  other  sensors  in  the  missile  are  tabu¬ 
lated  along  with  the  actual  acceleration  commands  generated  by 
the  advanced  guidance  law.  The  target  seeker  and  other  sensors 
in  the  missile  will,  in  general,  deny  explicit  observability  of 
some  quantities  used  in  computing  the  advanced  guidance  law  com¬ 
mands,  but  the  latter  is  computed  using  all  denied  information 
when  generating  the  training  data  base.  The  time  histories  of 
the  available  sensor  observables  are  used  as  input  variables  in 
the  ALN  synthesis  process,  while  the  commands  are  the  dependent 
variables.  In  this  way,  the  sensor  data  are  used  to  estimate  the 
information  not  explicitly  observed  and  to  generate  the  appropri¬ 
ate  guidance  commands  as  if  that  information  were  available.  The 
resulting  ALNs  embody  an  economical  realization  of  the  advanced 
guidance  law. 


ILLUSTRATIVE  EXAMPLE 

As  an  example  of  ALN  implementation  of  a  guidance  law,  let  us 
consider  implementation  of  proportional  navigation  (PNG).  In 
this  example,  as  in  many  tactical  missile  applications,  the  pri¬ 
mary  (and  perhaps  only)  explicitly  observable  external  quantity 
is  the  line-of-sight  angle  (X)  or  its  rate  of  change  (X).  (Note 
that  although  we  are  limiting  this  illustrative  example  to  a  two- 
dimensional  engagement  for  which  there  is  only  one  line-of-sight 
angle,  the  extension  to  three  dimensions  is  straightforward.) 
Other  variables  and/or  constants  of  the  engagement  kinematics  are 
usually  unknown,  although  explicit  identification  of  some  of  them 
is  sometimes  possible  through  the  application  of  techniques  of 
modern  control  theory. 

The  limited  information  in  this  situation  largely  accounts  for 
the  popularity  of  PNG,  wherein  the  rate  of  change  of  X  is  mea¬ 
sured  and  the  commanded  missile  normal  acceleration  (ajj,j^)  is  set 
proportional  to  this  rate: 
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Avoidance  of  overfitting  is  a  key  aspect  in  the  training  of  ALNs 
[12,9].  The  network  must  be  taught  to  generalize  properly  on  its 
experience  in  fitting  the  training  observations  so  that  error 
rates  for  new,  but  statistically  similar  data,  will  also  be  low. 
Overfitting  is  detected  by  cross-validation  and  by  employing  an 
information  theoretical  criterion  [13] .  When  cross-validation  is 
used  to  detect  overfitting,  the  known  data  base  is  divided  into 
two  independent  but  statistically  similar  subsets  called  training 
and  selection  subsets.  The  training  subset  is  used  to  determine 
the  coefficients  of  the  elements.  The  selection  subset  is  used 
to  reject  the  poorly  performing  basic  elements  and  to  detect 
overfitting.  While  the  error  rate  on  the  training  subset  is 
continuously  decreased  by  incorporating  additional  layers,  the 
error  rate  on  the  selection  subset  increases  when  overfitting 
occurs. 

When  the  number  of  observations  in  the  data  base  is  too  small  to 
form  the  two  independent  subsets,  the  growth  of  the  model,  i.e., 
the  increase  in  the  number  of  coefficients,  is  controlled  by  the 
use  of  Akaike's  information  criterion  (AIC)  [14].  The  AIC 
measures  the  poorness  of  the  model  and  consequently  needs  to  be 
minimized: 


AIC  =  -2Jln(maximum  likelihood)  (5) 

+2 (number  of  coefficients)  =  minimum 

The  AIC  can  be  considered  as  an  adaptive  F  test  where  the  risk 
level  changes  with  the  number  of  observations  and  the  number  of 
model  coefficients  in  the  two  models  to  be  compared  [15] .  Also, 
the  AIC  is  asymptotically  equivalent  to  the  maximum  likelihood 
model  [13];  i.e.,  the  model  selected  by  the  independent  selection 
subset  based  on  the  least-squares  error  criterion  approaches  the 
model  selected  by  the  AIC  as  the  number  of  observations  ap¬ 
proaches  infinity. 

If  the  size  of  the  data  base  permits,  an  independent  evaluation 
subset  is  used  to  estimate  the  overall  performance.  Since  the 
evaluation  subset  is  not  used  for  network  synthesis,  the  perform¬ 
ance  of  this  subset  is  an  accurate  estimate  of  the  ability  of  the 
network  to  generalize  to  new,  previously  unseen  data. 

In  summary,  the  ALN  method  is  an  empirical  technique  to  obtain 
the  structure  of  a  process  and  requires  no  a  priori  knowledge 
and/or  assumptions  about  the  process  itself.  The  relevant  fea¬ 
tures  are  selected  by  the  learning  algorithm  from  the  candidate 
feature  list  and  are  introduced  into  the  learning  network  in  the 
optimum  order.  Features  discovered  by  the  learning  algorithm  to 
be  of  little  or  no  use  are  discarded  automatically.  The  learning 
algorithm  permits  the  structure  of  the  network  to  grow,  i.e.,  to 
approximate  the  general  Kolmogorov-Gabor  polynomial,  but  only  to 
the  extent  required  by  the  data  base.  As  a  general  modeling  tool, 
the  ALN  methodology  is  applicable  to  detection,  classification, 
prediction,  and  control  of  a  wide  range  of  complex  biological. 
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physical,  and  engineering  processes  [10] .  Application  of  the  ALN 
methodology  has  been  successful  where  conventional  and/or  theo¬ 
retical  modeling  techniques  have  produced  poor  results. 


APPLICATION  TO  GUIDANCE  LAW  SYNTHESIS 

To  synthesize  an  ALN  guidance  law,  the  ALN  is  trained  on  example 
guidance  commands  for  a  variety  of  typical  engagements.  This  is 
accomplished  by  building  a  training  data  base:  an  advanced  guid¬ 
ance  law  (usually  a  law  requiring  complete,  explicit  observabili¬ 
ty  of  the  target®s  trajectory)  is  used  in  conjunction  with  a 
computer  program  that  simulates  air-to-air  engagements,  and  the 
analyst  compiles  a  library  of  typical  simulated  engagements.  The 
library  ideally  encompasses  the  range  of  launch  conditions  over 
which  the  resulting  guidance  law  is  expected  to  perform.  For  ex¬ 
ample,  launch  range,  initial  aiming  error  (of f-boresight  angle), 
and  the  target  aspect  angle  are  important  launch  parameters.  At 
each  time  step  of  each  simulated  engagement,  outputs  available 
from  the  target  seeker  and  other  sensors  in  the  missile  are  tabu¬ 
lated  along  with  the  actual  acceleration  commands  generated  by 
the  advanced  guidance  law.  The  target  seeker  and  other  sensors 
in  the  missile  will,  in  general,  deny  explicit  observability  of 
some  quantities  used  in  computing  the  advanced  guidance  law  com¬ 
mands,  but  the  latter  is  computed  using  all  denied  information 
when  generating  the  training  data  base.  The  time  histories  of 
the  available  sensor  observables  are  used  as  input  variables  in 
the  ALN  synthesis  process,  while  the  commands  are  the  dependent 
variables.  In  this  way,  the  sensor  data  are  used  to  estimate  the 
information  not  explicitly  observed  and  to  generate  the  appropri¬ 
ate  guidance  commands  as  if  that  information  were  available.  The 
resulting  ALNs  embody  an  economical  realization  of  the  advanced 
guidance  law. 


ILLUSTRATIVE  EXAMPLE 

As  an  example  of  ALN  implementation  of  a  guidance  law,  let  us 
consider  implementation  of  proportional  navigation  (PNG).  In 
this  example,  as  in  many  tactical  missile  applications,  the  pri¬ 
mary  (and  perhaps  only)  explicitly  observable  external  quantity 
is  the  line— of  —  sight  angle  (X)  or  its  rate  of  change  (X).  (Note 
that  although  we  are  limiting  this  illustrative  example  to  a  two- 
dimensional  engagement  for  which  there  is  only  one  line-of- sight 
angle,  the  extension  to  three  dimensions  is  straightforward.) 
Other  variables  and/or  constants  of  the  engagement  kinematics  are 
usually  unknown,  although  explicit  identification  of  some  of  them 
is  sometimes  possible  through  the  application  of  techniques  of 
modern  control  theory. 

The  limited  information  in  this  situation  largely  accounts  for 
the  popularity  of  PNG,  wherein  the  rate  of  change  of  X  is  mea¬ 
sured  and  the  commanded  missile  normal  acceleration  (^nin^  set 
proportional  to  this  rate: 
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where  Vjjj  is  missile  velocity  and  N  is  the  so-called  navigation 
constant. 

In  the  simplified  example  at  hand,  the  analyst  could  postulate  a 
family  of  intercept  engagements,  the  family  being  characterized 
by  a  given  value  of  the  navigation  constant,  N  =  N'.  Each 

member  of  this  family  would  have  its  unique  combination  of  values 
for  the  initial  conditions  and  constants.  Literally,  thousands 
of  engagements  could  be  "run  off"  on  the  computer,  and  the 
results  could  be  put  into  a  data  base,  shown  schematically  in 
Figure  1. 
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M  =  number  of  observed  points  in  data  base. 

FIGURE  1;  EXAMPLE  DATA  BASE  , 

The  ALN  training  procedure  could  be  used  to  synthesize  a  small 
network  in  which  the  output  (Y)  would  be  a^^^  and  the  inputs  (X-|l 
and  X2)  would  be  t  and  X.  At  the  conclusion  of  this  training,  one 
would  undoubtedly  find  that  the  ALN  is  one  which  generates  a 
close  approximation  to  the  original  guidance  law.  Equation  (6). 
Examination  would  show  that  the  output  of  the  ALN  and  from 

Equation  (6)  agree  fairly  closely  for  all  values  of  t  ana  X  in 
the  data  base.  In  this  case,  the  ALN  methodology  has  "discov¬ 
ered"  the  value  of  N^  used  to  generate  the  family  of  solutions. 
More  importantly,  the  methodology  has  "discovered"  a  way  to  infer 
the  unknown  (unknown  because  it  was  assumed  to  be  explicitly 
unobservable  within  the  missile  system)  .  Even  though  Vj^  varied 
with  time  and  was  different  —  in  general  —  for  each  solution  in 
the  data  base,  and  not  an  explicit  input  to  the  ALN,  the  proce¬ 
dure  for  ALN  synthesis  created  an  approximate  relationship  for 
its  identification. 
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Per'haps ^  after  creating  the  above  ALN  function^  it  would  be  found 
that  the  ALN  solution  is  not  a  sufficiently  close  approximation 
to  Eguation  (6) .  In  this  case^  a  more  complete  record  of  the 
histories  of  X  and  might  be  used  to  provide  addditional  input 

information  during  ALN  training.  The  results  of  the  engagement 
simulations  could  be  put  into  data  base  records,  each  record 
being  of  the  form; 

Y^it)  ,  y^(t-At),  ...»  yj^(O), 

X2^(t)f  X2j^(t"*At),  ...r  ^ 

where  At  is  a  constant  sampling  interval.  In  other  words, 
samples  of  X  would  be  kept  from  ^  the  time  of  missile 

launch  until  tne  time  of  "present"  calculation,  t. 

We  do  not  know,  a  priori,  how  far  back  in  time  the  guidance  law 
might  best  go  in  fetching  inputs  from  such  data  files  as  have 
just  been  described.  But,  no  matter.  The  ALN  methodology  will 
find  this  for  us,  using  only  the  most  relevant  samples  in  gener¬ 
ating  the  desired  approximation.  From  a  practical  standpoint,  we 
may  wish  to  enforce  a  limit  on  the  amount  of  memory  available  for 
the  guidance  system,  in  which  case,  the  data  records  could  be 
constructed  in  the  form: 

y^(t),  y^(t-At),  ...,  y^(t-kAt), 

X2i(t),  X2^(t-At),  ...,  X2j^{t-kAt) ;  t 

where  k  is  an  integer  (say  5  or  10)  .  At  the  beginning  of  the 
intercept  mission,  the  observed  values  of  yi(0)  and  X2i(0)  could 
be  loaded  into  appropriate  memory  locations  within  the  missile 
guidance  unit,  and  after  collecting  k  consecutive  samples,  the 
ALN  solution  would  be  fully  "initialized." 


ALN  SYNTHESIS  FOR  AN  OPTIMAL  GUIDANCE  LAW 

The  basic  principles  of  guidance  law  synthesis  using  ALN  training 
outlined  for  the  illustrative  example  have  been  applied  to  a 
guidgnce  law  (MG)  derived  using  known  optimal  control  theory  [2] . 
The  optimal  guidance  command  is; 

I  =  ^  ^  t 

go 

where  a  is  the  optimal  missile  acceleration  measured  in  an 
inertial  reference  frame,  S  is  the  missile— target  line— of— sight 
(LOS)  vector,  R  is  the  time  rate  of  change  of  the  LOS  vector, 
and  t_„  is  the  remaining  time  to  go  until  intercept.  In  terms 
of  thi°relative  range,  R,  range  rate,  R  and  the  line-of-sight 
angles  X„  and  X  depicted  in  Figure  2,  the  vectors  are; 

A. 
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If  the  subject  missile  is  equipped  with  a  gimballed,  inertially 
stabilized  IR  seeker,  the  inertial  LOS  angles  and  LOS  rates  are 
readily  available.  Then  acceleration  commands  given  by  Equation 
(7)  and  (Sa-f)  can  be  calculated  in  the  (inertial)  seeker  refer¬ 
ence  frame.  Control  of  the  missile  requires  that  acceleration 
commands  be  specified  relative  to  a  coordinate  system  fixed  to 
the  missile  body  axes.  This  is  easily  accomplished  if  the  orien¬ 
tation  of  the  seeker  is  known.  In  practice,  the  longitudinal 
acceleration  of  the  missile  is  often  uncontrollable  due  to  the 
type  of  propulsion  system  that  is  used.  In  these  instances,  aj*„ 
is  unrealized  and  a^^y  and  are  calculated  using  Equations  (7) 

and  (8a-f) .  These  equations  express  the  acceleration  in  terms  of 
the  LOS  angles  and  LOS  rates  typically  encountered  in  PNG  appli¬ 
cations. 


FIGURE  2;  MISSILE-TARGET  LINE-OF-SIGHT  GEOMETRY 
IN  INERTIAL  COORDINATES 
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The  missile  acceleration  commands  in  the  seeker  inertial  refer¬ 
ence  frame  can  be  written  [6]  : 


A 

=  K, cosa  cosa 

sx 

1  q 

A 

=  K-cosa  sina 

sy 

1  q 
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The  orientation  of  the  seeker,  relative  to  the  missile  body  axes, 
is  specified  by  two  angles,  ij;  and  0  .  The  transformation  be¬ 
tween  the  missile  body  coordinates  ana  seeker  coordinates  may  be 
written  as: 
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where  X^^and  are  measured  in  the  body  and  seeker  frames, 
respectively. 


For  the  gimballed,  initially  stabilized,  IR  seeker,  the  passively 
unknown  data  are  completely  contained  in  the  optimum  gain 
terms,  k,  and  k_.  Thus,  if  estimates  of  these  values  can  be 
obtained"^ from  the  passively  observable  data,  a  passive  implemen¬ 
tation  of  the  modern  guidance  law  can  be  obtained. 


Several  approaches  to  estimating  tg^  exist, 
deterministic  approach  to  the  estimation  of 
suited  to  generating  training  acceleration 
line  mode.  The  procedure  is: 


An  iterative,  non- 
tgQ  is  particularly 
ccmmands  in  an  off- 


(a)  Simulate  the  engagement  using  the  complete  MG  equations 
with  a  typical  calculation  of  tgQ  [2]. 

(b)  Save  the  duration  of  the  engagement,  t^. 

(c)  Simulate  the  engagement  again  as  in  (a),  except  use  tg^ 
=  tf-t,  where  t  is  the  elapsed  time  past  launch. 

(d)  Save  the  corrected  duration  of  the  engagement,  tf» 
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(e)  Repeat  (c)  and  (d)  until  tj  converges,  whence  t^^  is 

known  exactly  throughout  the  engagement. 

Engagements  produced  by  this  procedure  are  more  nearly  optimal 
than  those  that  make  use  of  estimates  of  tg^. 

The  ALN  training  approach  to  synthesizing  the  modern  guidance  law 
for  the  inertially  stabilized  gimballed  IR  seeker  consists  of  the 
following; 

(a)  Acquire  time  histories  of  the  passive  observables. 

(b)  Use  the  data  from  (a)  to  estimate  the  optimum  gains, 

and  K^, 

(c)  Use  the  estimates  of  k,  and  together  with  Equations 
(9)  and  (11)  to  calculate  tne  required  accelerations 
relative  to  the  missile  body  coordinate  system. 

Note  that,  although  R,  R,  and  t-Q  are  required  to  implement  MG, 
only  two  combinations  of  these^  quantities  in  the  form  of  the 
optimal  gains,  k,  and  are  required.  Thus,  the  ALNs  can  be 

trained  to  estimate  the  required  optimal  gain  rather  than  to 

estimate  the  three  separate  values. 


Training  Data  Base 

To  demonstrate  the  application  of  this  method,  a  training  data 
base  for  the  modern  guidance  law  was  obtained  by  simulating  air- 
to-air  engagements  using  a  six-degree-of- freedom  (6D0F)  simula¬ 
tion  program  provided  by  the  Air  Force  Armament  Laboratory.  The 
6D0F  program  implemented  MG  for  a  realistic  model  of  a  highly 
measurable,  short-range,  bank-to-turn  missile.  An  evasive  target 
maneuver  is  also  included  in  the  simulation  program. 

Simulated  engagements  were  run  for  a  variety  of  initial  engage¬ 
ment  conditions.  For  each  set  of  launch  conditions,  the  simula¬ 
tion  was  iterated  to  achieve  the  best  estimate  of  t-^.  For  the 
final  iteration,  values  for  each  of  the  passive  H^servables, 
together  with  the  optimal  guidance  commands  and  k,  and  k^,  were 
saved  in  a  training  data  file.  The  training  data  base  was 
limited  in  this  example  to  the  following  launch  conditions; 

-  Missile  and  target  were  co-speed,  flying  straight  and 
level  at  missile  launch  (0.9  mach) . 

Missile  and  target  were  co-altitude  at  launch  (10,000 
feet)  . 

Initial  aspect  angle  (angle  between  target  velocity 
vector  and  LOS  vector)  varied  from  0  to  180  in  45 
increments.  (Zero-degree  aspect  angle  is  tail-on.) 
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Initial  of f-boresight  angle  (angle  between^  missile 
velocity  vector^  and  LOS  vector)  was  either  25  leading 
or  lagging  or  0  . 

-  The  target  flew  straight  and  level  until  missile  range 
was  less  than  6000  feet,  at  which  time  the  target  per¬ 
formed  a  9— g ,  dual— plane  maneuver  into  the  attack. 

ALN  Training 

Approximately  1500  time  points  from  a  total  of  200  training  en¬ 
gagements  were  used  to  train  ALNs  to  estimate  the  optimal  values 
of  K  and  <2  passively  available  data.  Candidate  ALN 

inputs  included: 

o  Missile-Target  LOS  angles  in  Missile  Body  Coordinates: 
ET,AT 

o  Seeker  Gimbal  Orientation  angles: 

o  Missile  Target  LOS  rates  determined  by  the  Seeker: 


o  Time  since  launch:  t,  1/t  (t  >  0) 

For  each  of  the  angle  variables,  the  current  and  four  previous 
samples  of  each  variable  were  saved  for  training  (At  =0.05  sec). 

The  structure  of  the  trained  ALN  that  estimates  the  optimal 
gains,  k,  and  k,,  is  shown  in  Figure  3.  Each  element  of  the 
ALN  calculates  up  to  a  cubic  polynomial  of  its  inputs.  The  ALN 
estimator  is  a  complicated  function  of  the  candidate  input 
variables,  but  not  all  of  the  candidate  inputs  are  used.  Those 
inputs  that  provided  redundant  or  unnecessary  data  were  discarded 
by  the  ALN  training  routine. 

ALN  Guidance  Performance 

The  performance  of  the  ALN  guidance  law  was  determined  by  using 
it  to  guide  the  hypothetical  missile  in  simulated^  engagements. 
The  engagement  launch  conditions  were  generally  different  from 
those  used  in  training  the  ALN.  Two  comparisons  of  the  ALN 
guidance  law  have  been  carried  out; 

o  comparison  of  passive  ALN  guidance  with  the  full  modern 
guidance  law  (assuming  observability  of  all  states) ,  and 

o  comparison  of  ALN  guidance  with  PNG.  This  comparison 
shows  the  level  of  performance  improvement  that  can  be 
obtained  over  current  systems. 

Initial  evaluation  of  the  passive  ALN  guidance  laws  was  carried 
out  by  comparing  ALN  guidance  with  the  active  modern  guidance  law 
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FIGURE  3:  ALN  STRUCTURE  FOR  MODIFIED  PROPORTIONAL  NAVIGATION 
FOR  A  GIMBALLED  SEEKER  SYSTEM:  ALNGl 


for  the  same  launch  conditions  included  in  the  training  data 
base.  Defining  a  hit  as  an  engagement  for  which  the  terminal 
miss  distance  was  less  than  10  feet,  the  ALN  guidance  law  scored 
hits  for  82%  of  the  engagements  hit  by  active  modern  guidance. 
ALN  guidance  was  generally  inferior  to  MG  for  longer-range 
launches  that  generally  terminate  in  a  tail  chase.  A  linear 
combination  of  PNG  with  ALN  guidance  diagrammed  in  Figure  A,  was 
therefore  used  to  slowly  turn  off  ALN  guidance.  ALN-PNG  guidance 
was  successful  for  91%  of  the  engagements  hit  using  MG  .  Addi¬ 
tionally,  20%  of  the  engagements  not  hit  using  MG  were  hit  by  the 
combined  ALN-PNG  guidance  law. 

A  complete  comparison  of  the  ALN-PNG  guidance  law  with  MG  and 
with  PNG  was  obtained  for  266  engagement  conditions,  including 
of f-boresight  angles  up  to  ±45  .  Note  that  the  training  data 

included  only  of f-boresight  angles  to  ±25  .  The  detailed 

comparison  is  presented  in  Reference  6;  the  important  trends 
noted  are  summarized  here. 

First,  the  ALN-PNG  guidance  law  generally  achieved  intercept  with 
smaller  terminal-miss  distances,  slightly  shorter  flight  times, 
and  less  total  guidance  impulse  than  did  PNG.  Most  importantly, 
the  ALN-PNG  guidance  law  produced  successful  intercepts  more 
often  than  PNG.  This  was  especially  true  for  launches  with  lar¬ 
ger  aspect  angles  (  > 90°)  and  larger  of f-boresight  angles  (>20  , 
especially  noticeable  for  OBA  =  ±  45  ).  While  providing  a  real¬ 
izable  implementation  of  modern  guidance,  the  ALN-PNG  guidance 
laws  also  produced  successful  intercepts  over  a  considerably 
wider  range  of  launch  parameters  than  PNG. 

Summary  performance  matrices  are  presented  in  Figures  5  and  6. 
The  performance  evaluation  was  carried  out  for  the  ALN-PNG 
guidance  law.  It  can  be  readily  ascertained  that  the  passive 
combined  ALN-PNG  guidance  law  was  greatly  superior  to  PNG.  It 
had  19%  more  hits  when  the  initial  OBA  was  0  and  ±15  ,  35%  more 
hits  when  the  initial  OBA  was  ±25  ,  and  61%  more  hits  when  the 
initial  OBA  was  ±45  . 

On  comparing  the  passive  (ALN-PNG)  guidance  law  with  the  active 
modern  guidance  law.  Figure  6,  there  were  a  number  of  engagements 
(13)  hit  by  the  ALN-PNG  law  that  were  missed  by  MG,  while  only 
one  engagement  missed  by  the  ALN-PNG  law  was  hit  by  the  MG.  This 
shows  again  that  the  ALN-PNG  guidance  law  was  an  excellent  repre¬ 
sentation  of  MPN  over  the  envelope  of  tested  launch  conditions. 
Further,  the  engagements  in  which  the  ALN  guidance  hit  and  the 
MPN  missed  were  engagements  near  the  outer- launch  body  of  the 
missile.  This  indicated  that  the  phasing  in  of  PNG  in  the  ALN 
provided  a  guidance  law  that  was,  in  some  respects,  better  than 
MG.  Additional  independent  testing  of  the  ALN-PNG  guidance  law 
showed  that  dramatic  increases  are  obtained  in  the  outer  launch 
boundaries  over  those  available  using  either  PNG  or  MG. 

Additional  testing  has  compared  the  ALN-PNG  guidance  law  with  an 
extended  Kalman  filter  implementation  of  MG^  In  both  cases. 
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FIGURE  4: 


COMBINATION  OF  ALN  GUIDANCE  LAW  WITH  CLASSICAL 
PROPORTIONAL  NAVIGATION 
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FIGURE  5:  COMPARISON  OF  THE  COMBINED  ALN-PNG  GUIDANCE  LAW  WITH  CLASSICAL 
PASSIVE  PNG  FOR  TEST  PLAN  ENGAGEMENTS 
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FIGURE  6:  COMPARISON  OF  THE  COMBINED  ALN-PNG  GUIDANCE  LAW  WITH 
ACTIVE  MODERN  GUIDANCE  LAW  FOR  TEST  PLAN  ENGAGEI^ENTS 


realistic  noise  models  were  used  to  degrade  the  sensor  noise. 
These  tests  showed  that  ALN-PNG  produced  better  inner-launch 
boundaries  than  PNG.  MG  produced  better  inner-launch  boundaries 
than  either  ALN-PNG  or  PNG,  but  ALN-PNG  produces  the  best  outer- 
launch  boundaries.  In  the  presence  of  noisy  sensor  data,  the 
performance  of  ALN-PNG  was  not  greatly  degraded. 


CONCLUDING  REMARKS 

The  ALN  guidance  law  synthesis  approach  permits  the  use  of  ad¬ 
vanced,  "active"  guidance  laws  with  seeker  hardware  of  the  type 
normally  used  to  implement  classical,  passive  proportional  navi¬ 
gation.  In  their  explicit  deterministic  forms,  the  advanced 
guidance  laws  require  knowledge  of  range,  range  rate,  and  time- 
to-go,  in  addition  to  the  usual  LOS  angular  rates.  By  estimating 
gain  factors  involving  range,  range  rate,  and  time-to-to,  the 
ALNs  permit  implementation  of  the  advanced  laws  using  only  pas¬ 
sive  observables. 

The  computations  required  by  ALN  guidance  laws  could  be  implemen¬ 
ted  by  adding  a  small  microprocessor-based  subsystem  to  the 
passively-guided  missile.  The  microprocessor  would  be  supplied 
the  time  histories  of  the  passively-observable  data  and  calcu¬ 
lates  the  ALN  outputs.  The  capabilities  required  by  the  ALN  are 
well  within  the  capabilities  of  current  microprocessors  using 
ROM-based  software  and  a  small  amount  of  RAM  for  temporary 
storage. 

Previously,  advanced  guidance  laws  have  not  been  generally  uti¬ 
lized  in  air-to-air  missiles  because  much  of  the  data  necessary 
for  their  implementation  has  been  denied  by  operational  missile 
sensor  systems.  Now,  through  the  use  of  ALN  techniques,  this 
missing  information  can  be  largely  recovered  in  an  efficient  and 
noise— insensitive  manner  from  the  time  histories  of  the  data 
available  from  the  sensors.  This  opens  up  a  new  world  of  possi¬ 
bilities  for  missile  guidance  applications. 
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ESTIMATING  TIME-TO-GO  FOR  USE  IN  ADVANCED  GUIDANCE  LAWS 


Tom  L.  RiggSf  Jr.,  Capt,  USAF 
Department  of  Astronautics  and  Computer  Science 
United  States  Air  Force  Academy,  Colorado  80840 


INTRODUCTION 

The  modern  air-to-air  missile  engagement  scenario  dictates  the 
need  for  high  performance  terminal  guidance  laws  that  are  capable 
of  steering  missiles  towards  successful  intercepts  against  highly 
maneuverable  and  intelligent  targets.  Numerous  studies  [1,2,3] 
over  the  past  few  years  have  shown  that  linear  optimal  control 
theory  can  yield  extremely  effective  guidance  algorithms  that  are 
capable  of  meeting  and  exceeding  this  demanding  mission  objec¬ 
tive.  However,  these  guidance  laws  require  more  information  than 
is  directly  available  from  existing  hardware,  such  as  relative 
range,  relative  velocity,  possibly  target  acceleration,  and 
time-to-go.  To  meet  the  information  needs  of  these  advanced  gui¬ 
dance  laws  research  into  developing  estimation  algorithms  has 
been  pursued  nearly  as  enthusiastically  as  in  the: guidance  area. 
Most  notably  is  the  work  in  estimation  performed  for  the  U.S.  Air 
Force  Armament  Laboratory  by  the  University  of  Texas. [4]  In  this 
effort,  it  was  shown  that  by  using  optimal  estimation  techniques 
important  state  information  including  the  relative  range  vector, 
relative  velocity  vector  and  target  acceleration  vector  can  be 
accurately  estimated  from  very  restricted  and  noisy  passive  (in¬ 
frared)  seeker  measurements  and  on-board  body -fixed  missile  ac¬ 
celeration  and  angular- rate  measurements.  Verges  and  Riggs  [5] 
showed  that  by  combining  those  state  estimation  techniques  with 
even  the  most  simple  linear  feedback  guidance  law,  drastic  mis¬ 
sile  performance  improvements  could  be  realized  over  conventional 
guidance  methods  (proportional  navigation)  if  and  only  if  those 
laws  were  mechanized  with  an  accurate  estimate  of  time-to-go.  As 
in  any  practical  problem,  the  desire  for  high  performance  is 
weighted  and  often  limited  by  the  issue  of  complexity.  This  is 
the  basis  of  the  research  on  time-to-go  estimation.  That  is, 
what  is  the  most  accurate  method  for  estimating  time-to-go  given 
the  restriction  that  it  must  be  implementable  in  a  microprocessor 
based  missile  guidance  computer? 

THEORETICAL  DEVELOPMENT 


Background 

The  need  for  an  accurate  measure  of  time-to-go  (the  amount  of 
time  remaining  to  intercept)  arises  from  the  theory  that  is  used 
in  deriving  the  guidance  laws.  The  parameter  time-to-go  appears 
naturally  in  the  solution  of  optimal  control  problems  where  time 


177 


is  the  independent  variable  and  final  time  is  assumed  to  be 
fixed.  Of  all  the  optimal  control  formulations  that  have  been 
developed  over  the  last  twenty  years,  the  one  that  most  often 
yields  practical  solutions  is  the  linear  regulator  formulation. 
The  missile  intercept  problem  can  be,  and  has  been,  mathematical¬ 
ly  described  such  that  it  lends  itself  to  a  special  case  of  the 
linear  regulator  formulation.  The  advantage  of  this  approach  is 
that  the  resulting  solution  is  a  set  of  algebraic  equations  in 
feedback  form.  Thus  the  solution  is  concise,  relatively  simple, 
and  to  some  degree  self-correcting.  The  drawback  to  this  ap¬ 
proach  is  the  need  for  complete  state  information  and  knowledge 
of  final  time.  As  noted  earlier,  the  state  information  problem 
has  been  addressed  with  successful  results.  Because  of  this,  for 
this  effort  it  will  be  assumed  that  complete  state  information  is 
available.  The  final  time  issue  is  the  subject  of  this  research. 

The  missile  intercept  problem  is  in  reality  a  free  final  time 
problem  within  certain  physical  constraints.  To  wit,  there  is  a 
set  of  final  times  at  which  the  missile  can  intercept  the  target 
within  the  lethal  range  of  the  warhead.  This  set  is  bounded  and 
determined  by  the  degree  of  controllability  of  the  missile  and 
the  chosen  trajectory  of  the  target.  Obviously  if  the  missile 
has  complete  control  of  its  acceleration  vector  both  in  magnitude 
and  direction,  the  set  of  final  times  becomes  the  positive  real 
number  set.  However,  the  missile  only  has  partial  and  finite 
control  of  its  acceleration  vector  through  the  use  of  lateral 
aerodynamic  control.  This  controllability  restriction  greatly 
limits  the  size  of  the  final  set.  Further,  this  set  shrinks  as 
the  engagement  proceeds,  ultimately  resulting  in  one  possible 
time,  the  time  at  which  the  point  of  closest  approach  is  reached. 
Realizing  these  physical  constraints,  one  might  ask  why  not 
derive  the  guidance  law  assuming  partial  acceleration  control  and 
allowing  final  time  to  be  free.  The  answer  is  simple.  If  the 
problem  is  formulated  in  that  manner  then  the  solution  is  not  in 
closed  form  nor  can  it  be  easily  implemented  on-board  the  mis¬ 
sile. 

These  practical  constraints  along  with  the  desire  to  obtain  a 
practical  good  performing  but  not  necessarily  optimal  guidance 
algorithm  makes  it  attractive  to  solve  the  problem  using  the  op¬ 
timal  linear  regulator  formulation.  Given  this  approach  the 
mechanization  of  the  resultant  guidance  law  requires  the  estima¬ 
tion  of  final  time  (and  subsequently  time-to-go)  in  order  to 
satisfy  the  original  fixed  final  time  assumption. 
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Guidance  Law 


The  guidance  law  that  was  used  for  this  study  is  given  by  equa¬ 
tion  (1) . 


*  3  T 

1 


(1) 


where 

=  (e^SO  +  _i)/(i^go)2 


I  =3x3  Identity  Matrix 

{ 

=  Commanded  Missile  Acceleration  Vector 
Sj^  =  Relative  (Target/Missile)  Range  Vector 
Vj^  =  Relative  (Target/Missile)  Velocity  Vector 
A^  =  Target  Acceleration  Vector 

The  complete  derivation  of  the  guidance  law  is  given  in  Section 
II  of  Reference  [5]  .  This  law  is  based  on  a  linear  kinematic 
engagement  model  and  a  linear  first-order  target  dynamic  model 
and  is  designed  to  minimize  final  range.  The  term  ir  in  the  gain 
is  a  constant  and  is  a  function  of  the  time  constant  of  the 
target  model.  All  of  the  vectors  are  made  up  of  three  orthogonal 
components  with  respect  to  some  arbitrary  cartesian  coordinate 
system.  The  predominance  of  time-to-go,  denoted  by  tgo  is  clear 
in  the  equations. 

Time-to-go  Assuming  Constant  Closing  Velocity 

The  most  commonly  used  method  for  estimating  time-to-go  is  given 
by  equation  (2) .  This  method  is  based  on  the  assumption  that  the 
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(2) 


acceleration  along  the  line-of-sight  is  zero  for  all  time, 
tgo  =  -R/R 

Estimating  Time-to-go  Numerically 

The  problem  of  estimating  time-to-go  involves  the  prediction  of 
the  line-of-sight  acceleration  for  all  future  time.  This  is 
indeed  a  formidable  task  since  this  acceleration  is  the  dot  pro¬ 
duct  of  the  target/missile  relative  acceleration  vector  (a  func¬ 
tion  of  time)  with  a  unit  vector  that  lies  along  the  line-of- 
sight  (also  a  function  of  time) .  Obviously  this  cannot  be  deter¬ 
mined  in  general  because  of  the  uncertainty  of  future^  target 
maneuvers.  However ^  the  acceleration  along  the  line-of-sight  can 
be  approximated  by  assuming  that  the  target  acceleration  is  zero 
and  the  missile^s  axial  acceleration  is  the  dominant  contributor 
to  the  line-of-sight  acceleration.  Once  this  assumption  is  made 
the  problem  reduces  to  the  construction  of  a  time  dependent  func¬ 
tion  that  represents  the  missile‘*s  axial  acceleration.  This  was 
done  with  a  very  simple  function  (given  by  •  equation  3)  in  [6] 
resulting  in  a  time-to-go  algorithm  that  not  only  was  accurate 
under  many  launch  conditions  but  greatly  increased  the  missile  s 
performance  over  conventional  time-to-go  mechanizations.  Unfor¬ 
tunately  those  results  did  have  limitations,  especially  on  the 
long  range  launches.  This  was  due  to  the  function  used  to  model 
the  missile "“s  axial  acceleration. 

A  .time  <  time  of  engine  burnout 
max’  (3) 

A  .  .time  >  time  of  engine  burnout 
mm  ’ 

where  A  and  A  .  are  constants  determined  from  missile 
thrust/drag  characteristics. 

An  improved  approach  was  suggested  by  York  [7]  in  that  actual 
missile  axial  acceleration  data  is  used  to  curve-fit  an  approxi¬ 
mate  function  of  exponential  form  during  thrust  off  and  of  linear 
form  during  thrusting.  This  is  the  basis  of  the  following  numer¬ 
ical  .  algor  ithm. 

Algorithm  Development 

Based  on  an  assumed  model  of  the  line  of  sight  acceleration,  the 
algorithm  is  constructed  to  determine  the  time  at  which  the  range 
will  be  minimized  given  the  present  range  and  range  rate.  To  do 
this,  the  following  assumptions  will  be  made. 
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Assume  non-maneuvering  target 


2.  Assume  acceleration  along  line-of-sight  can  be  modeled  as 


Alos  ■  ■ 


‘"‘bo 


note  -  t|jQ  is  time  of  burnout 


R(t)  =  y*R  dt 

R(t)  =yR  dt  =//«  dt 


Since  R(t)  is  a  piecewise  continuous  function  with  one  discon¬ 
tinuity  there  will  be  three  possible  solutions  for  R(t)  and  R(t) . 
Those  solutions  are 


Definitions 


t^  -  present  time 
t£  -  final  time 
Rq  “  present  range 
R  -  present  range  rate 


^  i  -  to<tbo  »  tf<tbo 
R(t)=k(t-tQ)  +  R^ 


R(t)  =  |(t-t^2+  Ro(t-to)  +  Rq 


R(t)  = 


>  tu 

bo 

’  ^f  ^  ^bo 

a  ^bt 

a  ^bto  . 

b  ® 

-  E  ®  + 

h  h 


b 

787 


(4) 

(5) 


Case  3 


-  t 


This  case  is  a  combination  of  Case  1  and  2. 


Use  eqns  for  Case  1  and  solve  for 

substitute  those  quantities  into  the  equations  for  Case  2. 


Therefore 

R(t) 


a  a  J^^bo  .  p/x.  V 

b  ®  ■  b  ®  ^  ^'^bo^ 


(6) 


R(t) 


b  t 


bt.  _ 


(7) 


where 


R(t 

R(t 


bo) 


=  l"W=o)  ■"  "o 
=  4(tho-to)^  +  ^(‘bo-‘o'  + 


bo)  ■  2  "-bo  '-o 


(8) 

(9) 


Mechanization  of  the  Algorithm 

The  mechanization  of  the  algorithm  requires  logic  to  first  deter¬ 
mine  in  which  case  (1,  2,  or  3)  the  missile  is  operating.  This 
will  be  a  function  of  present  time,  range,  range-rate,  and  the 
near  future  acceleration  profile.  Once  the  case  is  determined, 
the  appropriate  equations  are  solved  by  the  following  methods: 

1.  For  Case  1  time-to-go  can  be  calculated  in  closed  form 
by 


tgo 


-V  v/(Ro)2  +  2kRo 


CIO) 
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2.  For  Case  2.  A  Newton  iteration  technique  is  used  to 
determine  the  time  at  which  R(t)  =  0  in  equation  (5).* 

3.  For  Case  3.  A  Newton  iteration  technique  is  used  to 
determine  the  time  at  which  R(t)  =  0  in  equation  (7)*  using  the 
values  of  R(tjjQ)  and  determined  by  equations  (8)  and  (9). 

*NOTE:  Prior  to  solving  equation  (5)  or  (7)  ,  a  check  must  be 
made  to  determine  if  R(t)  =  0  is  a  valid  solution.  This  is  done 
by  determining  the  time  at  which  the  range  rate  will  go  to  zero. 
To  do  this  equation  (4)  or  (6),  as  appropriate,  is  set  to  zero 
and  solved  for  t.  That  t  is  then  substituted  into  equation  (5) 
or  (7)  ,  appropriately,  and  R(t)  is  calculated.  If  R(t)  is  nega¬ 
tive  the  algorithm  proceeds.  If  not,  then  the  time  at  which 
R(t)  =  0  is  used  as  the  final  time. 

This  process,  although  involved,  is  solvable  with  a  digital  com¬ 
puter.  The  algorithm  is  intended  to  be  solved  every  At  as  the 
missile  flys  toward  the  target.  If  sample  time  causes  throughput 
bottlenecks  the  appropriate  modifications  could  be  made,  such  as 
solving  for  new  t^^’s,  every  two  or  three  guidance  computations, 
as  required. 

Closed-Form  Method 

Recall  that  the  reason  for  needing  time-to-go  in  the  guidance  law 
stems  from  the  original  assumptions  made  in  deriving  the  law; 
that  is,  that  final  time  is  fixed  and  the  missile  has  complete 
control  of  its  acceleration  vector.  The  following  time-to-go 
algorithm  attempts  to  rectify  these  deviate  assumptions. 

Consider  a  rewritten  form  of  the  guidance  law  given  in  equation 
(1) ,  and  referenced  to  the  missile  body  coordinate  frame. 


^  ^  ^T^TX^  (11a) 

Aj^Y  =  3  +  K^A^y^ 

hiZ  =  3  +  KjAj,)  (lie) 


Given  knowledge  of  the  vector  quantities  S„,  Vj^,  and  A  (recall 
this  information  will  be  supplied  by  the  state  estimator)  then 
the  unknowns  in  equation  (11)  are  tgo.  The 
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quantity  is  the  missile^s  axial  acceleration  command.  This 

quantity  is  not  controllable  and  would  normally  not  be  of 
interest.  However,  if  the  commanded  axial  acceleration  were 
forced  to  be  the  actual  measured  axial  acceleration  then  there 
would  be  only  one  unknown  in  equation  11a,  tgo.  This  is  the 
basis  for  this  time-to-go  algorithm.  That  is,  pick  a  tgo  that 
forces  the  commanded  axial  acceleration  to  equal  the  present  mis¬ 
sile  axial  acceleration  measurement.  The  value  of  tgo  is  then 
used  to  solve  equations  lib  and  lie  for  A™  and  Aj.„.  With  this 
logic  and  the  use  of  the  Quadratic  Pormula^the  following  closed 
form  solution  for  time-to-go  results. 


tgo 


2S 


RX 


where 


(12) 


"  ^X  ^^T^' 


TX 


t^-At 

o 


Mechanization  of  the  Algorithm 


To  mechanize  the  algorithm  all  the  vector  components  must  be 
known  with  respect  to  the  missile  body  coordinate  system.  The 
terms  S-^,  Sj^y,  and  A^^  are  obtained  from  the  state  estimator  and 
A^  is  tne  measured  axial  acceleration  from  an  onboard  accelerom- 
e^r.  The  term  K”  is  an  approximation  of  the  proper  value  of  K„. 
Recall  that  is  a  function  of  time-to-go.  This  creates  a  prob¬ 
lem  in  mechanizing  equation  (12)  directly  since  we  are  solving 
for  time-to-go.  Fortunately,  K™  varies  slowly  as  a  function  of 
time-to-go.  Using  this  fact,  equation  (12)  can  be  solved  using  a 
value  of  calculated  from  the  most  recent  past  estimate  of 
time-to-go,  hence  the  notation  K^. 

Exact  Non-realizable  Method 


In  order  to  evaluate  any  method  for  estimating  time-to-go  an 
exact  truth  model  must  be  established.  This  is  more  complicated 
than  flying  the  missile  and  post  priori  determining  final  time. 
The  reason  for  this  is  simple.  The  time-to-go  estimate  used  at 
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each  guidance  conunand  interval  will  cause  a  different  accelera¬ 
tion  command,  hence  a  different  trajectory  and  hence  a  different 
final  time.  To  determine  the  exact  trajectory  the  missile  will 
fly  with  the  exact  knowledge  of  time-to-go  requires  an  impracti¬ 
cal  iterative  method  comprised  of  multiple  flyouts.  Although 
such  a  method  is  non-realizable  for  real  world  applications,  it 
is  useful  as  an  evaluation  tool.  Such  a  method  was  developed  to 
determine  the  missile^’s  performance  using  exact  knowledge,  of 
time-to-go.  The  results  are  surprising  as  will  be  seen  later. 

The  algorithm  is  a  straight  forward  recursive  iteration  scheme 
often  used  to  solve  transcendental  equations  and  is  depicted  in 
Figure  1.  It  has  one  check  in  it  to  insure  stability  and  is 
exited  vgien  the  change  in  two  subsequent  flight  times  is  less 
than  lO”  seconds. 
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ANALYSIS 


Approach 

The  most  economical  and  effective  method  to  evaluate  any  time- 
to-go  algorithm  is  to  implement  it  in  a  detailed  simulation  of  a 
missile  system  and  perform  simulated  missile  fly-outs  against 
realistic  target  maneuvers.  This  was  accomplished  in  the  ana¬ 
lyses  of  the  techniques  presented  in  this  paper. 

The  simulation  consists  of  a  six-degree-of-freedom  (6-DOF)  mis¬ 
sile  model  of  a  conceptual  high  performance  bank-to-turn  short 
range  air-to-air  missile.  The  simulation  contains  detailed  non¬ 
linear  math  models  of  the  major  missile  subsystems  including  the 
seeker,  autopilot,  and  propulsion;  detailed  aerodynamic  models  of 
the  missile  airframe  characteristics  supported  by  wind  tunnel 
generated  aero  data;  and  the  models  that  describe  the  missile^s 
equations  of  motion.  Additionally,  the  simulation  contains  a 
three-degree-of-freedom  target  model  which  incorporates  a  nine 
"g"  out-of-plane  evasive  maneuver  algorithm. 

To  perform  the  analysis  the  guidance  law  given  by  equation  (1) 
was  implemented  with  each  time-to-go  algorithm  into  the  simula¬ 
tion  and  provided  all  required  information  assuming  zero  errors. 
Although  this  method  of  evaluation  deviates  from  the  "real" 
world,  it  does  provide  a  common  method  for  at  least  comparing  one 
time-to-go  technique  to  another.  To  be  sure,  if  a  particular 
technique  does  not  perform  well  under  these  ideal  conditions,  it 
certainly  won't  perform  well  under  more  realistic  and  restrictive 
conditions.  Therefore,  this  analysis  provides  an  intermediate 
step  in  determining  the  comparative  performance  of  the  candidate 
algorithms. 


Comparison  of  Algorithm's  Accuracy 


For  reference  purposes  the  four  algorithms  presented  in  this  pa¬ 
per  will  be  indexed  by  the  following  shorthand  names; 


TGOl 

TG02 

TG03 

TG04 


-R/R  (Equation  2) 

Numerical  Method  (Equations  4-10) 
Closed-form  Method  (Equation  12) 
Recursive  Method  (Figure  1) 


Figure  2  is  a  plot  of  the  time-to-go  estimates  versus  time  for 
the  four  algorithms.  As  can  be  seen,  TG02  and  TG03  and  of  course 
TG04  (recall  it  is  forced  to  be  accurate)  accurately  estimate 
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time-to-go  whereas  TGOl  has  considerable  error.  However,  all 
techniques  resulted  in  a  small  final  miss  distance  (less  than  2 
ft)  in  this  rather  mundane  engagement. 

Figure  3  shows  the  time-to-go  estimate  versus  time  from  TGOl  dur¬ 
ing  a  difficult  forward  hemisphere  of f-boresight  shot.  Note  that 
the  algorithm  over  estimates  time-to-go  early  in  the  flight  dur¬ 
ing  thrusting  and  then  underestimates  time-to-go  for  the  remain¬ 
ing  flight  (coasting  phase  begins  at  2.6  seconds).  This  happens 
in  all  cases  for  TGOl  due  to  the  fact  that  the  algorithm  assumes 
constant  closing  velocity  although  the  missile  sustains  high  ac¬ 
celeration  levels  due  to  thrust  and  drag.  For  this  case,  the 
missile  attained  a  final  miss  distance  of  15.6  feet. 

Figure  4  plots  the  time-to-go  estimates  for  TG02,  TG03,  and  TG04 
made  during  the  same  initial  engagement  as  in  Figure  3.  Each 
time-to-go  algorithm  caused  a  different  final  time,  hence  the 
three  different  asymtotes.  In  this  case,  TG03  performs  the  worst 
in  terms  of  estimation  accuracy,  however;  unexpectedly,  the  most 
accurate  time-to-go  method,  TG04,  results  in  an  extremely  poor 
miss  distance.  Other  similar  shots  were  tried  to  determine  if 
this  was  an  isolated  case  or  a  general  problem.  It  was  found 
that  the  same  problem  occurs  in  many  other  large  off-boresight 
forward  hemisphere  launches.  To  help  determine  the  reason  for 
this  deviation  from  expected  performance  consider  the  differences 
in  TG03  and  TG04.  TG03  underestimated  time-to-go  during  the 
thrusting  phase  and  then  tracked  true  time-to-go  nearly  exactly 
from  burnout  to  intercept.  The  only  significant  difference  in 
TG03's  accuracy  as  compared  to  TG04  was  during  the  thrusting 
phase.  Now  consider  how  this  affected  the  missile^s  acceleration 
commands  over  the  subsequent  flights.  Figure  5  plots  the  com¬ 
manded  normal  acceleration  profiles  associated  with  TG03  and 
TG04.  First  consider  the  profile  due  to  TG03.  It  is  clear  that 
the  missile  commanded  an  extremely  hard  initial  turn  and  then 
from  about  two  to  five  seconds  it  commanded  a  low  g  level  turn. 
The  peak  that  occurs  at  5.1  seconds  is  in  reaction  to  a  last 
ditch  target  maneuver  which  is  activated  at  approximately  1 
second  before  intercept.  Intercept  occurs  at  6.1  seconds.  Now 
consider  the  profile  associated  with  TG04.  For  the  first  five 
seconds  it  is  nearly  a  constant  26  g  turn.  At  5.7  seconds  the 
missile  reacts  to  the  target  maneuver.  During  the  last  half 
second  the  steering  errors  are  large  and  the  missile  is  command¬ 
ing  large  acceleration.  Unfortunately  the  missile  dynamics  can^’t 
process  these  commands  instantaneously  resulting  in  a  large  miss. 
It  appears  that  the  major  difference  in  the  commanded  accelera¬ 
tion  profiles  is  the  hard  initial  turn.  This  turn  occurs  during 
the  same  time  frame  that  TG03  is  underestimating  time-to-go. 
Since  this  occurs  early  in  the  launch,  the  geometry  of  the  en¬ 
gagement  is  virtually  the  same  indicating  that  the  different  ac- 
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celeration  profiles  are  directly  attributable  to  the  time-to-go 
algorithm. 

To  help  verify  this  claim  the  mechanization  of  TG04  was  modified 
such  that  a  value  of  .75  of  the  true  time-to~go  was  used  for  gui¬ 
dance  during  the  thrusting  phase.  After  burnout,  the  true  time- 
to-go  was  used  as  in  the  original  TG04.  The  missile  was  then  re 
flown  with  the  modified  TG04  resulting  in  a  miss  distance  of  .49 
feet.  More  importantly  the  acceleration  profile  for  the  modified 
algorithm  was  examined  to  see  if  it  exhibited  the  same  charac¬ 
teristics  as  TG03.  Not  surprisingly  it  did.  In  fact,  the  pro¬ 
files  were  nearly  the  same.  Figure  6  shows  the  commanded  ac¬ 
celeration  profile  due  to  the  modified  TG04.  This  is  a  very  sig¬ 
nificant  result.  It  suggests  that  in  terms  of  missile  perfor¬ 
mance  one  does  not  always  want  to  predict  time-to-go  accurately 
for  mechanizing  linear  feedback  laws.  However,  this  raises  the 
Question  how  and  when  should  the  time-to-go  estimate  be  inaccu 
rate?  To  help  answer  this  question  further  analyses  were  per¬ 
formed. 


Aspect 

Angle 

O-B 

Angle 

Range 

(FT) 

TG03 

TG04 

MOD  TG04  1 

*.75 

*.50 

45 

0 

4000 

1.85 

.61 

.81 

30.8 

45 

40 

.35 

.31 

.52 

.55 

90 

0 

6000 

2.37 

.69 

.87 

75.1 

90 

40 

6000 

.51 

.51 

.63 

.63 

90 

0 

10000 

.48 

.32 

■a 

.41 

90 

40 

10000 

.58 

333. 

.73 

.47 

mmm 

0 

15000 

.83 

.59 

.51 

.78 

40 

15000 

1.38 

85.2 

.49 

.17 

TABLE  1.  Comparison  of  Miss  Distances 
for  TG03,  TG04,  &  Modified  TG04 
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Experience  with  TGOl  clearly  establishes  that  time-to-go  should 
not  be  overestimated.  Doing  so  makes  the  missile  wait  until  the 
steering  errors  become  excessively  large,  forcing  drastic  end¬ 
game  maneuver  requirements,  possibly  exceeding  the  missiles  capa¬ 
bility.  This  means  that  if  there  is  to  be  error  in  the  time-to-go 
parameter  then  it  should  be  such  that  the  estimate  is  less  than 
the  true  value.  To  help  determine  when  and  by  how  much  time-to- 
go  should  be  underestimated,  the  modified  TG04  was  used  with 
values  of  .75  and  .5  true  time-to-go  during  thrusting  to  evaluate 
the  missile^'s  performance  in  various  engagements.  The  results  of 
this  study  are  given  in  Table  1.  The  results  show  that  knowledge 
of  true  time-to-go  can  result  in  large  miss  distances.  However, 
this  only  happens  in  initial  large  of f-boresight  launches  and  not 
when  the  missile  is  launched  on  a  near  intercept  trajectory. 
Further,  the  results  show  that  underestimating  time-to-go  by  a 
factor  of  .5  during  thrusting  can  cause  the  missile^s  performance 
for  on-boresight  launches  to  significantly  decrease.  The  combi¬ 
nation  of  these  results  indicates  that  time-to-go  should  only  be 
underestimated  in  large  off-boresight  launches  in  order  to  null 
out  the  large  steering  errors  as  soon  as  possible.  Doing  this 
forces  the  missile  to  attain  a  collision  course  early  in  flight 
minimizing  the  chances  of  late  evasion  due  to  drastic  target 
maneuvers.  A  further  benefit  of  the  resulting  trajectory  is  that 
the  missile  performs  its  hard  turning  at  lower  velocities  prior 
to  realizing  the  total  effects  of  thrusting,  thus  minimizing  in¬ 
duced  drag  effects  and  increasing  the  missile'^s  energy.  This 
will  naturally  result  in  extended  range  capabilities. 

Compensating  For  Large  Of f-Boresight  Errors 

To  be  sure, there  are  a  number  of  ad  hoc  approaches  to  compensat¬ 
ing  for  large  off-boresight  errors.  For  instance,  a  direct  ap¬ 
proach  could  be  to  add  a  bias  term  to  the  guidance  law  that  would 
be  directly  proportional  to  off-boresight  angle.  This  is  cer¬ 
tainly  a  feasible  but  possibly  very  Involved  approach.  A  more 
subtle  method  involves  mechanizing  the  time-to-go  algorithm  in 
missile  body  coordinates.  Recall  that  TG03  does  this.  To  under¬ 
stand  the  rationale  behind  this  approach  consider  Figure  7.  Fig¬ 
ure  7  depicts  the  relative  position  geometry  for  a  planar  engage¬ 
ment.  The  line-of-sight  vector  has  a  magnitude  equal  to  the  re¬ 
lative  range  and  lies  along  the  target/missile  sight  line.  The 
vectors  R  and  R  are  orthogonal  vectors  aligned  in  the  missile^s 
axial  and^  normal^  directions  respectively.  The  missile  guidance 
problem  is  simply  to  drive  and  maintain  the  magnitude  of  the  vec¬ 
tor  R  to  zero  prior  to  the  time  that  the  magnitude  of  the  vector 
R  go^s  to  zero.  Ideally,  at  final  time  R  and  R  will  both  go 
tS  zero  simultaneously.  The  optimal  guidance  law  attempts  to  do 
this.  Most  approaches  to  estimating  time-to-go  are  concerned  with 
determining  the  time  at  which  the  magnitude  of  the  line-of-sight 
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vector  will  be  minimized  but  not  necessarily  zero.  This  is  an 
important  yet  subtle  point.  The  only  scenario  in  which  the 
line-of-sight  vector  goes  to  zero  is  in  the  special  case  of  a 
direct  hit.  However,  for  any  given  control  logic  there  will  be  a 
time  at  which  the  line-of-sight  is  minimized  hence  a  valid  final 
time  for  use  in  calculating^  time-to-go.  Now  consider  the 
scena^rios  in  which  the  vector  R  goes  to  zero.  To  wit,  the  vec¬ 
tor  R  goes  to  zero  in  every  case  that  the  missile  can  overtake 
the  target.  In  other  words,  if  the  missile  has  a  velocity  advan¬ 
tage  over  the  target,  the  magnitude  of  R  will  go  to  zero.  This 
includes  all  cases  in  which  an  intercept  can  occur.  By  calculat¬ 
ing  the  time  at  which  this  vector  goes  to  zero,  that  time  can  na¬ 
turally  be  used  by  the  guidance  to  command  normal  accelerations 
such  that  the  magnitude  of  R  will  go  to  zero  before  or  at  that 
final  time.  ^ 

The  way  in  which  the  use  of  the  body  referenced  range  vectors  for 
time-to-go  calculations  relates  to  off-boresight  angle  is  simple. 
The  ratio  of  R  and  R  is  a  direct  function  of  the  off-boresight 
angle.  The  larger  the  angle,  the  smaller  the  magnitude  of  R  and 
hence  for  a  given  thrust  profile  the  smaller  the  value  for  time- 
to-go.  Since  the  guidance  gains  are  inversely  proportional  to 
the  value  of  time-to-go,  larger  normal  accelerations  will  be  com¬ 
manded  to  drive  the  magnitude  of  R^  to  zero. 

A  further  benefit  of  calculating  time-to-go  using  missile  body 
referenced  information  is  that  the  acceleration  in  the  missile 
axial  direction  is  well  defined  reducing  to  sources  of  error  to 
essentially  the  uncertainty  in  target  acceleration.  This  is  in 
contrast  to  the  uncertainty  in  line-of-sight  acceleration  which 
is  a  function  of  the  missile^s  axial  acceleration,  normal  ac¬ 
celeration  and  target  acceleration. 

Missile  Performance 

The  ultimate  performance  criteria  for  any  guidance  technique  is 
its  ability  to  hit  the  target  under  all  possible  initial  engage¬ 
ment  conditions.  One  commonly  used  method  for  measuring  how  well 
a  missile  meets  the  performance  criteria  is  through  a  determina¬ 
tion  of  the  missile^’s  inner  and  outer  launch  boundaries.  This 
was  done  to  evaluate  the  time-to-go  algorithms"  effect  on  the 
missile^s  performance.  Each  of  the  three  realizeable  time-to-go 
algorithms  (TGOl,  TG02,  and  TG03)  was  interfaced  with  the  gui¬ 
dance  law  and  mechanized  in  the  6-DOF  simulation.  Using  a  binary 
search  algorithm  the  inner  and  outer  launch  boundaries  were  com¬ 
puted  for  each  time-to-go/guidance  configuration.  (This  was  not 
accomplished  for  TG04  because  of  the  extremely  large  number  of 
runs  that  would  be  required) . 
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The  inner  launch  boundary  defines  the  minimum  range  from  which 
the  missile  can  be  launched  and  achieve  a  hit.  (A  hit  is  scored 
any  time  the  point  of  closest  approach  is  within  ten  feet  of  the 
target) .  The  outer  launch  boundary  defines  the  maximum  range 
from  which  the  missile  can  be  launched  and  achieve  a  hit.  Obvi¬ 
ously,  there  is  an  infinite  number  of  launch  conditions  that  can 
be  selected  for  evaluation.  In  order  to  limit  the  evaluation 
process  a  set  of  initial  launch  conditions  is  selected.  The 
selgction  of  these  conditions  should  be  done  such  ^  that  the 
evaluation  will  provide  a  good  sampling  of  the  weapon  s  perfor¬ 
mance  over  all  expected  initial  conditions.  To  this  end  the  fol¬ 
lowing  conditions  were  selected. 

1.  Missile  and  target  are  co-speed  and  flying  straight 
and  level  at  launch.  (.9  Mach). 

2.  Missile  and  target  are  co-altitude  at  launch 
(10,000  ft) . 

3.  The  initial  aspect  angle  (angle  between  the 
target'*s  velocity  vector  and  the  line-of-sight)  is 
varied  from  zero  to  180  degrees  in  30  degree  incre¬ 
ments.  Zero  degrees  aspect  angle  is  a  tail-on  shot 
whereas  180  degrees  is  a  head-on  shot. 

4.  The  initial  off-boresight  angle  (angle  between  the 
missile^'s  velocity  vector  and  the  line-of-sight)  is 
either  zero  degrees  or  40  degrees  lagging.^  Zero 
degrees  off-boresight  angle  means  the  missile  is 
launched  directly  at  the  target.  Forty  degrees  lagging 
off-boresight  angle  means  the  missile  is  fired  such 
that  the  missile^s  velocity  vector  is  pointed  40 
degrees  behind  the  target. 

The  first  two  conditions  (speed  and  altitude)  were  selected  to  be 
representative  of  dogfight  conditions.  The  aspect  angle  ^can  in 
reality  vary  from  zero  to  360  degrees,  however,  it  was  limited  at 
180  degrees  because  these  angles  represent  a  worst  case  due  to 
the  nature  of  the  target  maneuver.  The  two  off-boresight  angles 
were  selected  to  evaluate  the  missile  under  a  favorable  off- 
boresight  (zero  degrees)  and  at  an  extremely  difficult  off- 
boresight  condition  (40  degrees  lagging) .  The  initial  engagement 
geometry  is  depicted  in  Figure  8. 

Table  2  gives  the  inner  launch  boundaries  for  the  three  algo¬ 
rithms  while  Table  3  gives  the  outer  launch  boundaries.  The 
inner  boundaries  are  accurate  to  within  125  feet  whereas  the 
outer  boundaries  have  a  250  foot  accuracy.^  Note  that  for  the 
forty  degree  off-boresight  launches  the  missile  failed  to  hit  the 


194 


0° 

30° 

60° 

90° 

120° 

O 

o 

in 

180° 

1 

TGOl 

1000 

1000 

1375 

1675 

3125 

3125 

1875 

TG02 

1000 

1000 

1000 

1125 

2625 

2625 

1875 

TG03 

1000 

1000 

1000 

1000 

2625 

2875 

1875 

40° 

0 

B 

A 

TGOl 

* 

* 

4r 

★ 

3625 

4375 

4125 

TG02 

1000 

1375 

1875 

2125 

2325 

3625 

3875 

TG03 

1000 

1 

1000 

1000 

1375 

2125 

2875 

3875 

♦Designates  a  No-Hit  Condition.  The  Missile  Failed  to  Intercept 
From  Any  Launch  Range. 


TABLE  2.  INNER  LAUNCH  BOUNDARIES  IN  FEET  FOR 
T601,  TG02,  and  TG03 
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TABLE  3.  OUTER  LAUNCH  BOUNDARIES  IN  FEET  FOR 
TGOl,  TG02,  and  TG03 
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TABLE  5.  OUTER  LAUNCH  BOUNDARIES  IN  FEET  FOR  TG02,  TG03,  & 
TG02  COMPUTED  IN  BODY  COORDINATES 
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target  from  any  launch  range  when  the  guidance  employed  TGOl. 
This  was  due  to  TG01'*s  overestimation  of  time-to-go  which  caused 
the  missile  to  wait  too  long  before  adequate  steering,  thus  fly¬ 
ing  itself  into  a  non-recoverable  situation.  TG02  performed  much 
better  than  TGOl,  however,  its  performance  fell  far  short  of  that 
obtained  by  the  closed  form  algorithm,  TG03,  especially  in  the 
large  off-boresight  scenarios.  This  is  due  to  a  violation  in  the 
original  assumptions  used  in  deriving  TG02,  namely  the  assumption 
that  the  missile^s  axial  acceleration  is  closely  aligned  to  the 
line-of-sight  vector.  Based  on  this,  it  is  reasonable  to 
hypothesize  that  TG02'’s  performance  would  increase  if  it  was 
mechanized  with  missile  body  reference  information  as  is  done 
with  TG03.  To  see  if  this  was  true,  TG02  was  remechanized  using 
body  reference  information  and  re-evaluated.  Tables  4  and  5  give 
the  results  from  this  study.  As  can  be  seen,  the  performance  of 
the  missile  greatly  increased.  The  body  mechanized  TG02  obtained 
the  same  inner  boundaries  as  TG03  but  the  outer  boundaries, 
although  considerably  better  than  TG02,  fall  short  of  the  perfor¬ 
mance  obtained  by  TG03.  This  is  probably  due  to  the  fact  that 
TG02  assumes  zero  target  acceleration  whereas  TG03  does  not  make 
that  assumption.  Other  studies  have  shown  that  target  accelera¬ 
tion  information  improves  outer  launch  boundary  performance..  [5] 

Complexity  of  Solution 

One  of  the  fundamental  issues  involved  in  the  selection  of  any 
time-to-go  algorithm  is  the  issue  of  complexity.  Since  the  algo¬ 
rithm  will  be  implemented  in  a  small  dedicated  guidance  computer 
it  is  mandatory  that  the  algorithm  be  as  simple  as  possible  while 
maintaining  a  high  level  of  expected  missile  performance. 

Table  6  summarizes  the  complexity  requirements  versus  the  com¬ 
parative  performance  results  based  on  the  launch  boundary  deter¬ 
minations  for  TGOl,  TG02,  and  TG03.  The  table  gives  the  number 
of  Fortran  lines  of  code  used  to  mechanize  the  algorithms  in  the 
simulation  and  the  special  functions  needed  to  solve  the  algo¬ 
rithms.  As  can  be  seen  TG03  performed  the  best  and  was  relative¬ 
ly  simple  to  mechanize. 


ALG 

#Lines 
of  Code 

Special 

Functions 

Relative 

Performance 

TGOl 

2 

0 

3 

TG02 

48 

Square  Root 
Exponential 

2 

TG03 

3 

Square  Root 
Exponential 

1 

TABLE  6.  SUMMARY  OF  COMPLEXITY  REQUIREMENTS  AND  RELATIVE  PERFORMANCE 
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SUMMARY  AND  CONCLUSIONS 


Four  time-to-go  algorithms  were  developed  and  evaluated  to  deter¬ 
mine  both  the  accuracy  of  the  algorithms  and  the  effect  of  the 
algorithms  on  missile  performance.  The  following  results  were 
found. 


1.  The  commonly  used  method  for  estimating  time-to-go, 
range  over  range-rate,  severely  limits  the  missile'*s 
performance  capability. 


2.  If  the  guidance  law  is  subopt imal  as  is  the  one 
used  in  this  study,  use  of  perfect  knowledge  of  tirae- 
to-go  can  cause  substandard  missile  performance  in  high 
dynamic  engagements.  In  these  cases,  underestimation 
of  time-to-go  during  the  thrust  phase  will  compensate 
for  the  suboptimality  of  this  guidance  law  by  nulling 
large  initial  steering  errors  early  in  the  engagement. 


3.  Time-to-go  calculations  should  be  accomplished  in 
missile  body  reference  coordinates. 


4.  Iterative  techniques  for  estimating  time-to-go  are 
very  sensitive  to  modeling  errors  making  it  imperative 
that  the  algorithms  contain  complex  equations.  These 
methods,  although  solvable,  tend  to  be  very  compli¬ 
cated. 


5.  The  best  performing  yet  simplistic  algorithm  is  the 
closed-form  algorithm  that  forces  the  commanded  missile 
axial  acceleration  to  be  equal  to  the  actual  missile 
axial  acceleration. 
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'  ABSTRACT 

The  design  and  implementation  of  microprocessor-based  discrete-time  optimal 
controllers  for  the  XM-97  helicopter  gun-turret  control  system  is  considered. 
Nonfiring  and  Firing  test  results  are  presented.  Further  testings  and  the 
design  and  implementaion  of  disturbance  cancellating  turret  controllers  are 
currently  underway. 

‘  1 .  INTRODUCTION 

The  design  of  a  high  precision  helicopter  pointing  control  system  using  modern 
control  and  observer  theory  has  been  investigated  by  Coleman,  Loh,  et  al  in 
ri].  The  pointing  control  system  investigated  was  the  XM-97  helicopter 
guri-turret  control  system.  The  continuous-time  optimal  controllers  and 
observers  developed  in  [1]  v:ere  implemented  by  using  standard  analog 
electronic  components.  Thfe  resulting  performance  improvement  of  the  optimal 
turret  control  system,  in  terms  of  round  dispersion,  turret  overshoot,  turret 
settling  time,  etc.,  ranged  from  a  factor  of  2  to  1  to  a  factor  10  to  1  when 
compared  with  the  performance  of  the  original  system. 

Recently,  the  continuing  design  efforts  on  helicopter  turret  control  systems 
have  been  directed  towards  the  development  of  high  precision  turrets  employing 
microprocessor-based  optimal  controllers.  Results  presented  in  [2] 
demonstrated  that  it  is  indeed  feasible  to  implement  such  an  optimal 
controller  in  a  real  time  environment  by  using  state  of  the  art 
microprocessors.  Furthermore,  numerical  processor  chips  such  as  the  8087  are 
sufficiently  fast  enough  to  permit  implementing  some  forms  of  adaptive  control 
laws  which  may  further  enhance  control  system  performance. 

The  purpose  of  this  paper  is  to  present  some  preliminary  results  on  the 
performance  of  the  XM-97  helicopter  turret  investigated  in  [1]  employing 
microprocessor-based  optimal  controllers.  The  microprocessor  sofeware  was 
developed  by  using  a  basic  Intel  220  development  system  expanded  to  64K  bytes 
RAl*!  and  1.25  megabytes  disc. 

The  organization  of  the  paper  is  as  follows.  Section  2  gives  a  brief  descrip¬ 
tion  of  thJe  existing  XM-97  helicopter  gun-turret  control  system  and  its  step 
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responses.  Section  3  presents  a  simplified  mathematical  model  of  the  original 
system.  The  formulation  and  design  of  discrete-time  optimal  controllers  for 
both  the  azimuth  and  elevation  channels  of  the  gun-turret  based  on  the 
simplied  model  developed  in  Section  3  is  presented  in  Section  4.  The 
microprocessor-based  implementation  of  discrete-time  optimal  turret 
controllers  is  discussed  in  Section  5.  Section  6  presents  preliminary 
nonfiring  and  firing  test  results.  Further  performance  evaluations  of  the 
microprocessor-based  XM-97  helicopter  gun-turret  control  system  and  the  design 
of  microprocessor-based  disturbance  cancellating  controllers  are  currently 
underway;  the  results  will  be  reported  elsewhere. 


2.  DESCRIPTION  OF  THE  EXISTING  XM-97  HELICOPTER 
GUN-TURRET  CONTROL  SYSTEM 


The  XM-97  helicopter  gun-turret  system  consists  of  a  three-barrel  20mm 
automatic  cannon  system  and  a  turret  drive  system  mounted  under  the  nose 
section  of  a  Cobra  helicopter.  For  test  purposes,  the  gun-turret  and 
helicopter  airframe  are  suspended  from  a  six-degree-of-freedom  simulator  as 
shown  in  Fig.  1.  The  gun-turret  control  system  is  essentially  an  inertial 
load  driven  by  a  pulse  width  modulated  split  series  DC  motor  through  a 
compliant  gear  box.  The  transfer  functions  of  the  system  are  as  shown  in  Fig. 
2.  The  system  consists  of  two  controllers:  one  controller  positions  the  gun 
turret  in  azimuth  and  the  other  elevates  and  depresses  the  gun  cradle  and  the 
gun.  The  two  controllers  are  functionally  similar  and  independent.  As  shown 
in  Fig.  2,  the  only  difference  between  the  two  controllers  is  the  gear  ratio  N 
which  is  N  =  620  for  the  azimuth  channel  and  N  =  810  for  the  elevation  channel. 


TTie  existing  gun-turret  control  system  essentially  employs  angular  position 
feedback  and  angular  velocity  feedback.  With  the  state  variables  chosen  as 
shown  in  Fig.  2,  the  turret  dynamics  is  described  by  the  following 
8— d imens ional  vector  differential  equation  (for  both  azimuth  and  elevation 
channels), 

x(t)  =  Ax(t)  +  Bu(t),  x(0)  =  Xo» 


where 

x(t) 

x'l(t) 

X4(t) 

x’2(t) 

X3(t) 

X5(t) 

X6(t) 

X7(t) 


[x'i(t)  x'2(t)  X3(t)  X4(t)  X5(t)  X6(t)  X7(t)  xgCt)]'^, 

gun  turret  angular  position  relative  to  the  hull  (radians), 

gun-turret  angular  velocity  relative  to  the  hull  (radians/second), 

motor  angular  velocity  relative  to  the  hull  (radians/second), 

motor  torque  (foot-pounds), 

power  amplifier  output  (volts), 

low  level  electronics  output  (volts), 

geared-doT^n  shaft  angular  position  relative  to  the  hull  (radians). 
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Fig.  1  XM-97  HELICOPTER  GUN-TURRET  SUSPENDED  FROM  SIMULATOR 
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Fig.  2  ORIGINAL  XM-97  HELICOPTER  TURRET  CONTROL  SYSTEMS  -  AZIMUTH  AND  ELEVATION  CHANNELS 


xgCt)  =  output  of  tachometer  feedback  loop  (volts), 

Xj-(t)  =  gunner  command  input  (radians), 
u(t)  =  control  input  (volts)  =  “  x*3^(t) 

and  A  and  B  are,  respectively,  8x8  and  8x1  constant  matrices  given  by 


A  * 


0 

2/NxlO^® 

0 

-3.185x10® 

0 

-9.095x10^ 

0 

2/Nx10^°P2 


0 

0 

-9.6p^ 

0 

0 

0 

1/N 

0 


0  1  0 

0  4x10®  0 

-5x10^  0  BxlO^Pj 

0  -3.185x10®  0 

0  0  -5x10^ 

0  0  0 

0  0  0 

4x10®P2  0  0 


0 

0 

0 

0 

-2/Nxl0^° 

0 

0 

0 

0 

0 

3.185x10® 

0 

3.750x10® 

0 

-3.750x10' 

0 

-1.70x10^ 

0 

0 

0 

0 

0 

-2/Nx10^°P2 

-P3 

0 

0 

0 

0 

B*  0 

9.095x10^ 

0 

0 


Typical  step  responses  of  the  existing  turret  control  system  described  by  (l) 
for  x^(t)  =  175  milliradians  (10^)  are  as  shown  in  Fig.  3.  The  responses 

V7ere  oscillatory.  For  the  azimuth  channel,  there  was  a  60%  overshoot  in  the 
gun  angular  position  X2(t)  and  the  settling  time  t^  for  xj^(t)  was  miore  than 
tp  =  1  second. 
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(a)  Azimuth  Angular  Position  Error 


TEST  132,  lOnflRBO.  HTC  /  Xni97 

(b)  Elevation  Angular  Position  Error 


Fig.  3  STEP  RESPONSES  OF  ORIGINAL  XM-97 
HELICOPTER  TURRET  CONTROL  SYSTEM 
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3.  GUN-TURRET  MODEL  SIMPLIFICATION 


Tbe  first  step  involved  in  the  design  of  suitable  optimal  controllers  for  the 
XM-97  helicopter  turret  control  system  was  to  obtain  suitable  open-loop  models 
for  the  turret  system.  Hence  the  position  and  velocity  feedback  paths  in  the 
existing  system  indicated  by  the  dotted  lines  in  Fig.  2  were  first  removed;  a 
7th  order  open-loop  turret  control  system  resulted  for  each  of  the  azimuth  and 
elevation  channels.  The  next  step  was  to  simplify  the  7th  order  mathematical 
model.  It  was  decided  to  ignore  stable  poles  of  magnitudes  larger  than  150 
corresponding  to  a  time  constant  of  T  =  1/150  =  6.67  milliseconds.  Hence 

63(5),  05(5)  and  GgCs)  in  Fig.  2  become,  respectively, 

63(8)  =  PI, 

G5(s)  -  7.5, 

GgCs)  =  535. 

Furthermore,  it  is  found  that  the  effect  of  the  shaft  compliance  on  the 
response  of  the  helicopter  gun-turret  drive  could  be  ignored.  The  parameter 
Kc  was  therefore  eliminated  by  setting  it  to  an  arbitrary  large  value  in 
Fig.  2.  The  elimination  of  Kc  in  Fig.  2  has  also  resulted  in  the 
elimination  of  the  geared-down  shaft  angular  position  xy(t)  which  is  not 
accessible  for  on-line  measurement.^  With  all  the  simplifications  as 
discussed  above,  it  can  be  shown  that  the  helicopter  gun-turret  system  shown 
in  Fig.  2  reduces  to  the  open-loop  system  shown  in  Fig.  4  when  the  position 
and  velocity  feedback  paths  are  removed. 

The  dynmaics  of  the  simplified  gun-turret  of  Fig.  4  can  be  shown  to  be 
described  by 


where  x'i(t)  and  x'2(t)  are  the  same  as  in  (1),  J  -  3x10”^  for  the 

azimuth  channel  and  J  =  2.7x10“^  for  the  elevation  channel. 


iHad  xy(t)  not  been  eliminated,  a  suitable  filter  or  observer  would  be 
required  to  generate  an  estimate  of  xj(t)  in  the  implementation  of  an 
optimal  controller. 
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Fig.  4  SIMPLIFIED  OPEN-LOOP  XM-97  HELICOPTER  TURRET  CONTROL  SYSTEMS 
WITH  J=3xlO'’^(AZ)  AND  J=2.7xl0  ^(EL) 


Fig.  5  SIMPLIFIED  OPEN-LOOP  XM-97  HELICOPTER  TURRET  CONTROL  SYSTEMS 

USED  FOR  DESIGN  OF  OPTIMAL  TURRETS  WITH  J=3xlO"'^(AZ)  AND  J=2.7xlO"  (EL) 
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4.  DESIGN  OF  DISCRETE-TIME  OPTIMAL  TURRET  CONTROLLER 


The  design  of  continuous-time  optimal  controllers  for  the  XM-97  helicopter 
gun-turret  control  system  has  been  investigated  in  [1]  and  [3].  In  [1]  and 
[3],  the  design  problem  was  formulated  as  shown  in  Fig.  5,  V7ith  the  dynamics 
of  the  open-loop  system  described  by,  for  both  the  azimuth  and  elevation 
channels, 

x(t)  =  Ax(t)  +  Bu(t)  +  Fv^,  x(0)  =  (3) 

where 

x^(t)  =  fxi(t)  X2(t)], 

X2(t)  =  x^(t)  -  X 

=  error  between  the  position  common  input  x^  (radians)  and  the 
actual  gun-turret  angular  position  x^Ct)  (radians), 

X2(t)  =  Nv^  -  x*2(t) 

=  error  betv^een  the  velocity  command  v^-  (radians/second)  and  the 
actual  motor  angular  velocity  X2(t)  (radians/second), 

Xr^t)  =  x^  +  V|-t 

=  step-plus-ramp  position  command  input  (radians), 
u(t)  =  control  input  (volts), 
and  A,  B  and  F  are  constant  matrices  given  by 


To  design  suitable  m i croprocessor-based  optimal  controllers  for  the  helicopter 
gun-turret  control  system,  equation  (3)  is  first  discretized  by  using  a 
sampling  interval  T  seconds  as  follows: 
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(4a) 


:(k+l)  =  Ac|x(k)  +  Bjju(k)  +  F^v^.,  x(0)  -  Xq’ 


where  x(k)  =  x(kT),  u(k)  =  u(kT), 


A 


AT 


dl2 


0  a 


d22 


®d= 


"d= 


T  At 

Ce  Bdt  == 

*^o 

T  At 

Je  Fdt  = 


'dll 

’dl2 


^dll 

^dl2 


(4c  ) 


(4d) 


and  the  components  in  the  A^,  and  F^  matrices  can  be  computed  easily 
for  each  of  the  azimuth  and  elevation  channels. 

The  design  objective  is  to  drive  x(k)  to  the  zero  state  and  in  the  same  time 
minimizing  a  quadratic  performance  measure.  To  achieve  the  objective,  the 
control  u(k)  is  first  split  into  two  parts  as 


u(k)  =  ufi,(k)  +  uff(k), 


(5) 


where  UfT,(k)  is  the  feedback  component  responsible  for  driving  the  state 
x(k)  to  the  zero  state,  and  uff(k)  is  the  feedforward  component  responsible 
for  accommodating  the  velocity  command  Vj-,  i.e.. 


Using  (3),  (4)  and  (6),  it  is  not  difficult  to  show  that 

f. 


Uff 


(j,)  =  _il  V  =  4.785NxlO"5vr  =  krVf. 

b,2  r 


(6) 


(7) 


Substituting  (5)  and  (7)  into  (4)  yields 

x(k+l)  =  Adx(k)  +  BdUfb(k),  x(0)  =  Xq.  (8) 

Equation  (8)  may  be  written  as,  by  adding  and  substracting  the  same  term 

-B  R“^S^x(k)  on  the  right-band  side  of  the  equation^  [4], 
d  d  d— 


^The  reason  for  using  (9)  is  to  provide  an  alternate  method  for  solving  the 
optimal  control  problem*  See  (12)*“(13)  and  (2o)““(26). 
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(9) 


x(k+l)  =  (A  -  E^R"^sJ)x(k)  +  B^[ujj^(k)  +  R^^sjx(k)] 


=  A  x(k)  +  BjU  (k),  x(0)  =  X  , 
eq  ~  d  eq  •“  — o  " 


where  and  are  constant  matrices,  and 


A 

eq 


A 

=  A  - 


u  (k)  -  u., 
eq  rb 


~1  T 


(k)  +  R^S^x(k). 

G  Q-^ 


(10) 

(11) 


Consider  the  performance  measure 
00 

fx'^CkOO^xCk)  +  2x'^(k) S^u^^(k)  +  u^l^(k)R^u^^(k)  ]  (12) 

k=0 


which  may  be  written  as  [4],  by  completing  the  square  for  the  terms  inside  the 
brackets , 


J 

eq 


=  2  [x^(k)Q  x(k)  + 


u  (k)R  ,u  (k) 
— eq  d— eq 


(13) 


where  J^j  ~  '^eqj 


(14) 


while  defined  in  (11).  In  (12)  and  (13),  is  an  nxn  (n=2 

for  the  present  problem)  symmetric  posit ive-semidefinite  weighting  matrix, 
R^  is  an  rxr  (r=l  for  the  present  problem)  symmetric  positive-definite 
weighting  matrix,  and  is  an  nxr  (n=2,  r=l)  weighting  matrix.  In  the 

design  of  an  optimal  control  system,  the  numerical  values  of  the  elements  of 
the  weighting  matrices  Q^j ,  R^j  and  S^j  are  often  chosen  in  a  trial  ana 
error  basis,  guided  by,  perhaps,  the  physics  of  the  problem  and  design 
experience.  Hov^ever,  it  is  important  that  these  weighting  matrices  be  chosen 
such  that  [4]  -  [5], 


(i)  [A  , 
eq 


where 


d'd 

d  d 


'd®3- 


is  completely  observable,  i.e.. 


rank[D 


T  T 
eq  d 


lA^Vl 

‘  eq  d 


.(n-l)T^T,  _ 
eq  d 


(15) 


(ii)  [Agq,  is  completely  controllable,  i.e.. 


n. 


(16) 


The  resultant  optimal  closed-loop  control  system  will  then  be  asymptically 
stable  if  (condition  (i))  and  only  if  (condition  (ii))  conditions  (i)  and  (ii) 
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are  satisfied. 


Given  (8)  and  (12),  or  equivalently  (9)  and  (13),  we  wish  to  find  the  optimal 
control  which  minimizes  the  performance  measure  >  or  equivalently 
It  is  well  known  that  the  optimal  control  problem  posed  by  (8)  and  (12),  or 
equivalently  (9)  and  (13),  is  a  discrete-time  version  of  the  following 
continuous-time  optimal  control  problem: 

Given  the  continuous-time  system. 


x(t)  =  Ax(t)  +  Bu(t),  21^^)  ~  21o 

find  the  optimal  control  which  minimizes  the  performance  measure 

J  =  f  [x'^(t)Qx(t)  +  u^(t)Ru(t)  ]dt ,  (18) 

0  ” 

where  Q  and  R  are,  respectively,  symmetric  positive-semidef inite  and  symmetric 
positive-definite  weighting  matrices.  The  continuous-time  system  described  by 
(17)  may  be  discretized  as  shown  in  (4).  The  corresponding  discrete-tim^e 
version  of  (18)  is  then  given  by  (12)  with  the  following  substitutions  [6]: 


T  T 

Q  =re  Qe  dt, 


(19) 


=  f  [B  +  B^(t)QB^(t)]dt, 

^  r\ 


(20) 


T  T 

Sj  =  J*  e  QB(t)dt , 


(21) 


B(t)  =  fe^^BdO 


(22) 


We  observe  that  although  there  is  no  cross-product  term  of  the  form 
x'^(t)Su(t)  used  in  (18),  the  cross-product  term  due  to  in  (12)  is 

generally  non— zero,  unless  Q  =  0  as  may  be  seen  from  (21).  We  remark  also 
that  the  cross-product  term  in  (12)  may  be  dropped  if  one  is  not  interested  in 
establishing  the  corresponding  relationships  between  the  continuous-time  and 
the  associated  discrete-time  optimal  control  problems. 

Now,  the  optimal  control  which  minimizes  the  performance  measure  given  by 
(12)  subject  to  (8)  is  given  by  [5] 

.  -(Ej  +  *  s’)i(k)  <23a) 

=  k^Xj^(k)  +  (2;3b) 
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(23c) 


=  k^[x^(k)  -  x^(k)]  +  k^fNv^  -  x^Ck)], 


where  kj  and  k2  are  constants,  and  is  the  s)nnTnetric  positive-definite 
solution  of  the  algebraic  matrix  Riccati  equation 


'  ^d'^d^d  *  ‘^d 


-(bJk  A  +  sJ’)’^(R  +  bJk  B  )"’^(bVa  +  sT) 
ddd  d  d  ddd  ddd  d 


(24a) 


=  A  +  Q 

eq  d  d  eq 

-A^  K,B.(R,  +  BjK,B,)"^bVa  . 
eqdd  d  ddd  ddeq 


(24b) 


Equation  (24b)  follows  from  (24a)  by  simple  algebraic  manipulations. 


Similarly,  the  optimal  control  which  minimizes  Jgq  given  by  (13)  subject  to 
(9)  is  given  by 


where  is  the  symmetric  positive-definite  solution  of  (24).  As  expected, 
substitution  of  (25)  into  (11)  yields  (23). 

Combing  (7)  and  (23),  the  total  control  is  given  by 


u(k)  =  -(R,  +  B^K.B  ,)”^(BjK>.  +  S^)x(k)  +  k  v  (26a) 

d  ddd  ddd  d—  rr 

“  kixi(k)  +  k2X2(k)  +  k^-v^.  (26b) 

Substituting  (26)  into  (4a)  yields  the  optimal  helicopter  gun-turret  control 
system 

x(k+l)  =  [A  -  B.(R,  +  bVb,)~^(bVa,  +  S^)]x(k),  x(0)  =  x  .  (27) 

—  d  dd  ddd  ddd  d—  —  — o 

A  block  diagram  for  a  microprocessor-based  implementation  of  the  optimal 
controller  given  by  (26)  is  as  shown  in  Fig.  6. 
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Fig.  6  OPTIMAL  XM-97  HELICOPTER  TURRET  CONTROL  SYSTEMS-  AZIMUTH  AND  ELEVATION  CHANNELS 


5.  IMPLEMENTATION  OF  DISCRETE-TIME  OPTIMAL  CONTROLLER 

The  hardware  used  to  execute  the  discrete-time  optimal  controllers  for  both 
the  azimuth  and  elevation  channels  developed  for  the  »i-97  helicopter  turret 
control  system  developed  in  Section  3  is  as  shown  in  Fig.  7.  It  consists  of 
two  SBC  310  high  speed  mathematics  boards,  an  SBC  86/12  (16  bits,  8086  based) 
single  board  computer  and  a  SBC  732  combination  A/D  and  D/A  board  (12  bits). 
A  TI  Silent  700  provides  the  capability  for  quick  parameter  adjustment  of  the 
controller  algorithms  during  nonfiring  and  firing  tests.  We  remark  that  only 
one  SBC  310  high  speed  mathematics  board  is  needed  to  execute  the  control 
algorithms.  The  second  SBC  310  mathematics  board  is  used  as  a  back-up  and 
also  to  provide  additional  computation  capability  for  follow-on  development. 
To  implement  the  controllers  without  disturbance  cancellation,  only  the  SBC 
86/12  single  board  computer  is  needed.  One  complete  iteration  of  the  algorithm 
requires  0.8  milliseconds  when  implemented  using  fixed  point  assembly  language 
code.  This  rapid  execution  time  makes  it  possible  to  test  and  compare 
discrete-time  optimal  controllers  for  sample  intervals  of  T=1  millisecond  and 

T-10  milliseconds.  The  nonfiring  and  firing  test  results  will  be  presented  in 
the  next  section. 


The  electronic  hardware  is  housed  in  a  9"  x  15"  x  16"  aluminium  case  and 
weighs  approximately  30  pounds.  One  of  the  design  requirements  for  the  system 
IS  that  it  be  fully  transportable.  This  has  been  made  necessary  by  the  fact 
that  the  software  development  facility  is  located  at  the  Dover,  New  Jersey 

site  and  the  nonfiring  and  firing  testing  facility  is  located  in  Rock  Island, 
Illinois.  * 


digital  turret  control  system,  an  initialization 
outine  INIT  initializes  all  programmable  devices  and  transfers  all  programs 

from  PROM  to  RAFJ  memory.  The  desired  program  is  then  called  by  executing  an 
appropriate  interrupt.  ^  ® 

A  flow  diag^ram  of  the  digital  optimal  controller  software  is  as  shown  in  Fig. 
8.  The  subroutine  INPAZ  brings  in  and  scales  the  azimuth  position  error, 
turret  rate  and  sight  rate  signals  and  compute  rate  error  xo(k).  The 
subroutine  INPEL  performs  a  similar  function  for  the  elevation  channel.  The 
subroutine  DAOT  outputs  the  calculated  control  signals  u(k)  for  both  the 
azimuth  and  elevation  channels  to  the  D/A  converters.  Before  the  signal  u(k) 
is  sent  to  the  D/A  converter,  however,  it  is  converted  to  fixed  point  and 
clipped  at  12  bits  (+10  volts)  to  prevent  saturation  of  the  D/A.  In  order  to 
provide  maximum  flexibility  in  software  development  and  implementation, 
extensive  use  is  made  of  three  macro  routines  COMP,  GET  and  FIN.  The  MACRO 
C0MP(WT,  DATAl,  DATA2,  OPCODE,  TMP)  functions  as  follows.  If  WT  is  other  than 
null,  the  program  checks  to  determine  if  the  high  speed  mathematics  board  has 
completed  its  computation  before  continuing.  DATAl  and  DATA2  are  the  address 
routine  for  the  4  byte  operator  and  operand,  respectively;  if  either  is  a 
null,  the  old  values  is  used.  OPCODE  is  a  hexidecimal  number  from  0  to  F 
indicating  the  type  of  operation  to  be  performed.  If  this  value  is  a  null 
then  no  operation  takes  place.  If  TMP  is  a  null,  the  MACRO  is  terminated 
after  the  high  speed  mathematics  board  is  started.  Otherwise  the  program 

is  completed.  The  4-byte  result  is  then  stored  in 
the  address  pointed  to  by  TMP.  The  MACROS  GET(WT,  TMP)  and  FIN(WT)  are 
actually  subsets  of  the  MACRO  C0MP(WT,  DATAl,  DATA2,  OPCODE,  TWP).  A  complete 
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Fig.  7  XM-97  DIGITAL  CONTROLLER  HARDWARE 


216 


INIT 

Initialize  All  Variables  to  Zero 


Set  Timer 


I 


INPAZ 

3 


DUAZ  =  C1AZ»E^Z  +  C2AZ*RTAZ  i 


IWAZ  =  ZAZ  +  C3AZ*ERAZ  +  C4AZ*RTAZ  I 


DUAZ  =  DUAZ  +  C5AZ*WAZ 


Fig.  8  XM-97  OPTIMAL  CONTROLLER  FLOW  CHART 

(INCLUDING  DISTURBANCE  CANCELLATING  CONTROL) 
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listing  of  the  software  program  is  available  upon  request. 

The  microprocessor  development  system  used  to  debug  the  optimal  controller 
algorithms  is  as  shown  in  Fig.  9.  The  facility  is  located  at  the  Army 
Armament  Research  and  Development  Command  laboratory  in  Dover,  New  Jersey. 
The  entire  system  consists  of  a  basic  INTEL  220  development  system  expanded  to 
64K  bytes  RAN  and  1.25  megabytes  disc.  In  addition,  there  is  a  high  speed 
line  printer,  an  8“’channel  12-bit  A/D,  a  4-channel  12-bit  D/A  and  a  PROM 
programmer  for  the  2708,  2716  and  2732  PROMS  (8,  16  and  32  bits  per  unit, 
respectively).  Since  the  8080  requires  1.2  ms  to  perform  a  32Kbit  floating 
point  multiplication,  a  means  of  speeding  up  computation  is  required  for  real 
time  execution  of  algorithms.  Therefore,  a  SBC  310  high-speed  mathematics 
board  is  added  which  does  the  same  multiplication  in  85  microseconds. 
However,  this  board  must  communicate  with  the  CPU  via  the  system  bus  in  order 
to  store  and  then  load  the  required  four  byte  data  words.  This  process 
requires  approximately  90  microseconds.  To  minimize  this  excess  overhead 
time,  another  SBC  310  high-speed  mathematics  board  was  added  which  permits  one 
board  to  compute  while  the  other  is  storing  data. 


Fig.  9  MICROPROCESSOR  DEVELOPMENT  SYSTEM 
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6.  TEST  RESULTS 


Firing  and  nonfiring  tests  of  the  microprocessor  controlled  XM-97  helicopter 
turret  control  system  v?ere  conducted  using  the  following  values  of  Q  and  R  in 
(18),  \ 


0  0 


The  corresponding  Q^,  R^  and  S^  were  computed  according  to  (19),  (20)  and  (21), 
respectively.  The  above  choice  of  Q  implied  that  only  x^(t)  =  x^(t)  -  x|(t) 
was  weighted  so  that  large  amplitudes  of  xi^](t)  were  discouraged. 

The  optimal  control  used  was  given  by 

u(k)  =  kix(k)  +  k2X2(k)  +  k^v^. 

The  following  cases  were  investigated: 


‘>11 

‘>11 


=  5, 

=  10, 

=  15. 


using  a  sampling  intervals  of  T=l  millisecond  and  T=10  milliseconds.  It  was 
found  that  the  case  of  and  a  sampling  interval  of  T=1  millisecond  gave 

the  best  results  for  both  the  firing  and  nonfiring  tests. 

(a)  Nonfiring  Tests 

For  nonfiring  tests,  the  step  responses  of  the  azimuth  channel  of  the  original 
turret  and  the  optimal  digital  turret  are  as  shown  in  Fig.  10(a)  ^  (b)  and 
Fig.  11(a)  -  (b) .  The  step  Inputs  used  were: 

( i )  Original  Turret 

Xi.(t)  =  2°, 

Xr(t)  =  50. 

( i i)  Optimal  Digital  Turret 

Xr(t)  =  2°. 

The  statistics  of  the  step  responses  shown  in  Figs.  10  ~  11  are  summarized  in 
Table  1.  The  ^step  responses  of  the  original  turret  exhibited  an  average 
overshoot  of  50%  while  there  was  no  overshoot  for  the  optimal  digital  turret. 
The  settling  time  of  the  original  turret  was  about  4.4  times  longer  than  that 
of  the  optimal  turret. 
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TABLE  1:  STEP  RESPONSES  OF  ORIGINAL  AND  OPTIMAL 
TURRETS-AZIMUTH  CHANNEL 


1 

TURRET 

SETTLING 

TIME  (SEC) 

%  OVERSHOOT 

COMMENTS 

Original 

Average  of 

2®  Step 

49% 

left  &  right 

5^  Step 

0.74 

51% 

excursions 

Optimal 

Smooth 

qii=5, 

0.14 

None 

response 

T=1  ins 

Optimal 

Slight 

0.14 

None 

chattering 

T=10  ms 

response 
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X^(t)  -  xj(t) 


DEnOOULflTEO  ftZinUTH  ERROR  -  FILTER  =*  lOOHZ 
TEST  012.  04QCT79.  ROD  TURRET  CONTROLLER 


(a)  STEP  RESPONSE  -  AZIMUTH  CHANNEL 


TIME  IN  SECONDS  X  10+1 
CH  A  AZMITH  ERROR  -  DEMODULATED  X  10-t  1 
TEST  66,  13flPR78,  5  DEG  AT  5H2 


(b)  STEP  RESPONSE  -  AZIMUTH  CHANNEL 


Fig.  10  STEP  RESPONSES  OF  ORIGINAL  XM-97  GUN-TURRET 
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x^(t)-x|(t) 


TIME  IH  SECONDS  X  10+1 
AZ  DEHOO  ERROR  -  DTC  FILT-100H2  X  10*a 
TEST  02  /  OTC/  eSJULSl 

(a)  STEP  RESPONSE  -  AZIMUTH  CHANNEL 


(b)  STEP  RESPONSE  -  AZIMUTH  CHANNEL 

Fig.  11  STEP  RESPONSES  OF  OPTIMAL  XM-97  GUN-TURRET 
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(b)  Firing  Tests 


The  firing  tests  were  conducted  using  20  rounds  burst  fire*  The  firing  rate 
was  600  rounds  per  minute.  In  order  to  ignore  the  transient  response  of  the 
turret,  the  statistics  of  the  first  6  rounds,  corresponding  to  approximately 
0*6  second  of  firing  time^,  were  neglected*  The  performance  of  the 
original  turret  and  the  optimal  digital  turret  in  terms  of  the  standard 
deviations  of  their  rounds  dispersion  are  summarized  in  Table  2*  A 
performance  improvement  of  approcimately  2  to  1  was  obtained*  Further  tests 
using  different  values  for  and  using  disturbance  cancellating  optimal 

digital  controllers  are  currently  underway*  The  results  will  be  reported 
elsewhere* 


TABLE  2:  STANDARD  DEVIATIONS  FOR  RANGE  OF  85  FEET 


OVERALL 
AZIMUTH  (INCH) 

BARREL  1 
AZIMUTH  (INCH) 

BARREL  2 
AZIMUTH  (INCH) 

BARREL  3 
AZIMUTH  (INCH) 

D  30 

1.25 

0.70 

1.15 

1.76 

D  31 

1.78 

1.17 

0.80 

2.42 

0  33 

2.11 

1.35 

2.23 

0.99 

D  36 

1.57 

1.47 

1.46 

1.53 

D  37 

1.87 

1.70 

1.14 

1.38 

0  38 

2.26 

2.28 

1.93 

1.80 

D  =  Optimal  Digital  Turret 
0  =  Original  Turret 


^The  average  settling  time  of  the  original  turret  for  nonfiring  tests  was 
about  0*62  seconds;  see  Table  1* 
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DISCRETE-TIME  DISTURBANCE-ACCOMMODATING  CONTROL  THEORY; 
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INTRODUCTION 


In  a  recent  paper  [1],  the  theory  of  Disturbance-Accommodating  Control  (DAC) 
was  extended  to  include  the  case  of  discrete-time,  sampled-data  control  prob¬ 
lems.  The  results  presented  in  [1]  covered  the  disturbance  absorption  (can¬ 
cellation,  rejection,  counteraction)  and  disturbance-minimization  modes  of 
control,  but  did  not  consider  the  disturbance-utilization  mode  of  control. 

In  the  present  paper,  we  complete  the  development  begun  in  [1]  by  deriving 
a  rather  general  theory  for  the  disturbance-utilization  mode  of  digital  DAC 
for  the  case  of  linear  plants  with  a  quadratic  performance  index. 

The  motivations  for  a  theory  of  digital  DAC,  and  the  detailed  derivation  of 
the  basic  discrete-time  models  for  the  plant  and  disturbances  are  well- 
documented  in  [1]  and  therefore  will  not  be  repeated  here. 


SUMMARY  OF  DISCRETE-TIME  MODELS  FOR  LINEAR  PLANTS, 
DISTURBANCES  AND  COMMANDS 


The  class  of  plants  and  disturbances  considered  here  are  assumed  to  be  such 
that  their  discrete-time  response  can  be  modeled  by  the  following  linear 
difference  equations  (see  [1]  for  the  derivation  of  these  difference 
equations  from  their  continuous-time  counterparts) 


Ex(nT)=A(nT)x(nTj+B(nT)u(nT)  +FH(nTjz(nT)  +  Y(nT)  da) 
y(nT)=C(nT)x(nt) 

external  disturbance  vector  =  |-|(nTj^(nT) 

-  ^  (Id) 

E  ^(r\T)  =I)(nT)  +  (J'fnT) 


Some  possible  generalizations  of  Eqs.  (lb)  and  (lc,d)  are  described  in  [1]. 
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where  E(-)  denotes  the  forward  shift  operator:  Ex(nT)  =  x((n+l)T)  etc., 

X  =  (xi . xn)  is  the  plant  state-vector,  y  =  (yi ,  •••>  ytn)  the 

plant  output  vector,  u  =  (u] ,  ...,  u^.)  is  the  plant  control  input, 

-w  =  (Vi,  ...,Wn)  is  the  plant  disturbance  input,  and  z  =  (fl.  •••;  ZgJ 
notes  the  "state-vector"  for  the  disturbance  process  (Ic),  (Id).  In  the 
case  of  plants  and/or  disturbance  processes  which  are  governed  by  under¬ 
lying  linear  continuous-time  differential  equations  of  the  form 


X  =A(i}x  +B(-1:)U+  Fft)v(W 
y=C(i)x 

V  =  nm 


(2a) 

(2b) 

(2c) 

(2d) 


where  a  =  (oi,  ap)  denotes  an  unknown,  sparse  sequence  of  randomly 
arriving  (once-in-a-while), random  intensity  imoul^es  having  a  specified 
minimal  adjacent  spacing  >  0,  the  matrices  A,  etc.  in  Eq.  (1)  are 
related  to  their  counterparts  in  Eq.  (2)  as  follows  (see  LU) 


=  state-transition  matrix  for 


B(nTj=/  ^(n+/JJ^r)B(rJc/t 
F  H (nT)  7  Mmi)Tjr)F{r)hft)f  k 


f(nT)  -  it) f  §JrJ)(r(^Jc/!^]cl'£ 

W  ifnT 

rtHnn)? 


(3a) 

(3b) 

(3c) 

(3d) 

(3e) 

(3f) 


I  r  A7/ 

where  $n  denotes*^the  state  transition  matrix  for  D(t).  Note  thal 
turbance-like  terms  y.  o  in  Eq.  (1)  [called  "residuals"  in  [1]] 


Note  that  the  dis- 
account 

for  the  action  of  the  a(t)  impulses  which  arrive  between  adjacent  sampling 
times  to  +  nT,  to  +  (n+l)T.  Since  the  arrival  times  and  intensities  of  the 
sparse  cr(t)  impulses  are  completely  unknown,  (no  probabilistic  stucture)  the 
terms  y,  a  are  also  completely  unknown. 


The  primary  objective  of  control  is  assumed  to  be  expressible  as  the 
point  regulation  of,  or  servo-tracking  by,  certain  specified  plant  variables 
(yi>  y?*  •••»  ym)  >  where  in  general  the  y-j  can  be  related  to  the  plant  state- 
variables  (x^,  ...,  x^)  by  the  linear  expression 


Note  that  the  argument  symbol  nT  appearing  on  the  left  side  of  Eq.  (3) 
actually  denotes  the  time  t  =  to  +  nT.  This  shorthand  notation  will  be 
used  consistently  in  this  paper. 
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(4) 


y=Cli)x-  y=(yi/-%yffi) 

In  many  cases,  the  objective  is_to  control  the  plant  output  y(t)  in  Eq.  (lb), 
in  which  case  one  would  choose  C  =  C  in  Eq.  (4).  On  the  other  hand,  the 
desire  to  control  the  entire  plant  state  x(t)  would  be  indicated  by  chtos.-c, 

C  =  I  in  Eq.  (4). 

In  accordance  with  standard  procedures  in  DAC  design  [2],  the  desired 
Commanded)  behavior  of  y(t)  is  expressed  in  continuous-time  by  the  "set¬ 
point/servo-command"  dynamical  model 

y(0  =yp=Gii)c(i)  (5a) 

desired  _ 

C  =Eii)c+^Ji)  (5b) 

where  {G(t),  E(t)}  are  determined  a  priori  by  appropriate  "command  modeling" 
procedures;  see  [2;  p.  642];  and  where  c  =  (c],  ...,  c^)  represents  the 
"state"  of  the  command  model*  Eq.  (5).  The  vector].^  =  ..., 

sents  a  sequence  of  totally  unknown  impulses  which  are  sparse;  similar  in 
nature  to  the  a(t)  im£ulses  in  Eq.  (2d).  It  is  assumed  that  the  set-point/ 

servo-command  vector  y^  =  (yci . 7cj^)  might  not  be  known  a  priori,  but 

can  be  directly  and  accurately  measured  on-line,  in  real-time. 

In  the  case  of  set-point  regulation  problems  th_e  command  yc(t)  is  essentially 
constant,  or  piecewise  constant,  in  which  case  Ejt)  e  0  in  Eq.  (5b)  and  one 
can  then  set  G(t)  e  I  in  Eq.  (5a),  assuming  the  y^  iji  Eq.  (4)  are  independent. 
In  the  case  of  servo-tracking_problems ,  the  command  yc(t)  is  allowed  to  con¬ 
tinuously  vary  with  time  and  E(t)  is  chosen  accordingly;  see  [2;  p.  642]. 

For  purposes  of  designing  discrete-time  controllers,  it  is  necessary  to  have 
a  discrete-time  version  of  the  set-point/servo-command  model  Eq.  (5). 
Following  the  same  procedure  used  for  Eq.  (1),  the  discrete-time  model  of 
Eq.  (5)  is  obtained  as 

y^(nT)=G(nT)c(nT);  c(MT)=E(nTjc(hTj+;i/r.T)  (6) 

where 

E(nT)-$_^+(n+l)T)i+nT^=  transition  matrix  for  E(t)  (7a) 

M  =  f  °  (7b) 

The  information  embodied  in  the  real-time  command-state  c(nT)  enables  the 
DAC  controller  to  "accommodate"  uncertain  servo-command  behavior 
similar  to  the  way  disturbances  are  accommodated. 

*The  symbols  c,  f  in  Eq.  (5b)  correspond  to  the  symbols  x  ,  R  used  in 
[1;  Eq.(43)]. 
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DESIGN  FOR  THE  DISTURBANCE-UTILIZATION  MODE 


The  two  modes  of  disturbance  accommodation  considered  in  [1]  were  designed 
to  cope  with  disturbances  by  counteracting  (absorbing)  or  minimizing  their 
effects.  This  design  attitude  reflects  the  traditional  view  of  disturbances 
as  causing  only  unwanted,  disruptive  effects  on  the  plant  behavior.  However, 
there  are  realistic  situations  in  which  disturbances  are  capable  of  pro- 
ducing  desirable  effects  on  the  plant  behavior.  In  particular,  it  is  possible 
that  at  least  some  of  the  action  of  disturbances  can  be  constructively  used 
to  assist  the  controller  in  accomplishing  the  primary  control  task.  The 
trick,  of  course,  is  to  know  just  how  to  manipulate  the  control  u(nT),  in 
real-time,  so  as  to  harness  and  exploit  any  useful  effects  inherent  in  the 
(uncertain)  disturbance  actions. 


The  systemmatic  design  of  continuous-time  controllers  to  optimally  utilize 
the  action  of  uncertain  disturbances  was  first  introduced  in  [3],  and  has 
since  been  refined  and  applied  in  [2],  [4],  [5],  [6],  [7]-_  Ifi 
we  will  derive  discrete-time  versions  of  "disturbance-utilizing  controller 
design  procedures  which  parallel  the  results  in  [2],  [4],  and  [6J. 

THE  CHOICE  OF  A  PERFORMANCE  INDEX  T  IN  DISTURBANCE-UTILIZING  CONTROL  PROBLEMS 


The  objective  of  disturbance-utilizing  control  is  to  make  maximum  (optimal) 
use  of  the  disturbance  w(t)  as  an  aid  in  accomplishing  the  primary  control 
task.  For  instance,  if  the  primary  control  task  is  to  achieve  set-point 
regulation  or  servo-tracking  with  minimal  expediture  of  control  resources 
(fuel,  energy,  etc.),  it  is  conceivable  that  the  action  of  disturbances  w(t) 
might  be  able  to  reduce  the  drain  on  control  energy  pd/or  achieve  "better 
set-point  regulation  or  servo-tracking  —  if  u(nT)  is  manipulated  properly. 
On  the  other  hand,  if  the  disturbance  actions  are  such  that  they  are  totally 
counter-productive  to  the  primary  control  task,  the  use  of  an  optimal  dis¬ 
turbance-utilizing  controller  will  serve  to  minimize  the  inevitable  loss  of 
performance  contributed  by  the  disturbance. 


The  optimal  utilization  of  disturbances  is  achieved  by  application  of  optimal 
control  theory,  where  the  per forma nee- index  functional  J  is  structured  such 
that  the  minimization  of  J  by  u(nT)  achieves  the  primary  control  task  while 
simultaneously  making  maximum  "use"  of  w(t).  In  the  continuous-time  version 
of  disturbance-utilizing  control  theory  [4]  the  most  common  choice  of  per¬ 
formance  index  J  for  set-point  and  servo-tracking  problems  is  the  classical 
error/control  quadratic  functional 

where  £(t)  denotes  the  instantaneous  "con^trol  error";  i 
desired  response  y-  and  actual  response  y(t);  and  S,  Q,  -  r--_ 
definite  symmetric  matrices  chosen  by  the  designer.  The  design  of  uu;  to 
minimize  Eq.  (8)  automatically  achieves  the  primary  control  task  of 


(8) 

. e.  the  error  between 
R  are  positive 


M£(T.)||  =  "small"  [and  ||e(t)||  =  "small"],  while  simultaneously  letting 
w(t)  ^"assist"  in  that  task  and/or  in  (possibly)  reducing  control  resource 
consumption  as  measured  by  the  time-integral  of  u ' (t)R(t)u(t) .  If  the  dis¬ 
turbance  w(t)  is  such  that  it  cannot  "assist"  in  reducing  J  in  Eq.  (8),  the 
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control  u(t)  which  minimizes  Eq.  (8)  will  then  automatically  minimize  any  per¬ 
formance  deterioration  (increase  in  J)  which  w(t)  contribues. 

In  discrete-time  optimal  control  theory,  the  most  common  discrete-time 
version  of  Eq.  (8)  is  expressed  as 

n*(N-) 

n-0 

where  the  interval  of  control  [tQ,  T^]  is  divided  into  N  equal  segments 
tn  =  t^+nT;  n  =  0,  1,  2,  ...  N.  Actually,  if  one  evaluates  the  continuous¬ 
time  performance  index  Eq.  (8)  over  each  segment  t^+nT  ^  t  <  t^  +  (n+l)T, 
using  the  known  solution  expression  for  Eqs.  (2),  (5),  it  can  be  shown  [8] 
that  Eq.  (8)  may  finally  be  expressed  in  the  form^  of  Eq.  (9)  with  the  ex¬ 
ception  that  there  is  an  additional  term  2  uT{nT)M(nT)e(nT]^  in  the  summation 
on  the  right  side  of  Eq.  (9).  In  that  case,  the  matrices  Q,  M,  R  are  re¬ 
lated  to  Q,  R,  B,  A  in  Eqs.  (8),  (2)  through  some  rather  involved  integrals. 

In  practical  applications  of  discrete-time  optimal  control  it  is  generally 
preferable  to  adopt  the  format  Eq.  (9)  as  the  starting  point  for  structuring 
the  performance  index  J  and  then  design  the  weighting  matrices  S,  Q,  R  in 
Eq.  (9)  to  attach  proper  emphasis  on  the  minimization  of  e(nT),  u(nT); 
n  =  0,  1,  2,  ...,  N.  For^  this  reason  we  hereafter  adopt  Eq.  (9)  as  the 
basic  performance  index  J  for  the  design  of  discrete-time  disturbance¬ 
utilizing  controllers  for  set-point  and  servo-tracking  problems. 

FORMULATION  OF  A  GENERAL  CLASS  OF  DISCRETE-TIME  DISTURBANCE-UTILIZING 
CONTROL  PROBLEMS 

The  systemmatic  design  of  disturbance-utilizing  controllers  can  be  achieved 
by  formulating  the  problem  as  a  conventional  (undisturbed)  linear-quadratic 
discrete-time  control  problem  for  which  solution  algorithms  are  known.  For 
this  purpose,  the  discrete-time  models  Eqs.  (1),  (6),  (7)  are  consolidated 
into  one  composite  "plant"  model  and  written  as 

(Ex(nT)]  [AlOlFHl/xffll  [B.]«W/i 
EcfnT)  =  O  E  Q  cW+  O  +A 

»  [dJ  \? 

x(nTi=Moiq|g] 

izWj 

*Note  that  the  arguments  NT,  nT  in  Eq.  (9)  actually  represent  the  times 
t  =  tg+NT,  t  =  tg+nT.  This  shorthand  notation  is  consistent  with  that  used 
in  Eqs.  (1)»  (6),  (7)  etc.;  see  footnote  associated  with  Eq.  (3). 


(10a) 

(10b) 


For  simplicity,  the  model  Eq.  (10a)  is  written  in  the  more  compact  form 

Ex=A(nT)xffl+B(nT)um+^(nT];  x=fx|ck)‘'  <") 

where  the  meanings  of  A,  B,  5,  are  clear  from  examination  of  Eq.  (10a). 

The  instantaneous  control  error  e(t)  in  Eq.  (8)  is  th^  difference  between 
the  desired  response  and  the  actual  response.  Since  yc(t)  and  y(t)  represent 
those  two  responses,  we  write  e(t)  as 


eW  =  yc®“y^t) 


or,  in  terms  of  discrete-time  t  =  t^  +  nT 


Using  Eqs.  (4),  (6),  expression  (13)  may  be  expressed  in  terms  of  x  as 

e=[-C|&|0]x=Cx:  ;  C=['C1G-|0]  ^  04) 

Now,  the  quadratic  forms  in  the  .discrete-time  performance-index  J  in  Eq.  (9) 
may  be  expressed  as  *  .t  a 

-r/V^A/  A  A  A*  A/  A 

/5€  =  x  CSCx=x5x  ;  S=C5C  nsa) 


^Qe  =xQx  ;  Q-CQC 


(15b) 


Using  Eq.  (15),  J  in  Eq.  (9)  may  finally  be  expressed  in  terms  of  the  com- 
posite  state  x  as  follows 

(NTjS  +1:  A  ^fnT)0(nT)^  W  ^  ^  ®  ^ 

n=o 

The  optimal  disturbance-utilizing  control  problem  for  discrete-time  set-point 
regulation  and  servo-tracking  may  now  be  expressed  precisely  as  follows. 

Find  the  control  sequence  u(nT)  =  u°(nT),  n  =  0,  1,  2,  (N-1).  which 

minimizes  the  performance  index  Eq.  (16)  subject  to  the  difference  equation 
constraint  Eqs.  (10),  (11)  and  for  arbitrary  initial  conditions  {x(0)^  c(0) 
z(0)}  =  x(0).  Since  the  a(t),  impulses  which  create  the  terms  y,  Pc*  ^ 

in  Eq.  (10a)  are  completely  unknown  and  sparse,  we  will  follow  standard 
procedure  in  DAC  theory  and  disregard  the  presence  of  those  terms  in  Eq. 
(10a);  see  remarks  in  [2;  p.  639], 

SOLUTION  OF  THE  DISCRETE-TIME  DISTURBANCE-UTILIZING  CONTROL  PROBLEM 

The  minimization  of  (16)  subject  to  Eqs.  (10),  (11)  has  the  form  of  the 
conventional  (undisturbed)  discrete-time  linear  quadratic  regulator  pr(3blem 
which  has  already  been  solved;  see  for  instance  [8].  That  known  solution, 
when  applied  to  the  specific  plant  Eqs.  (10),  (11)  and  performance  index 
Eq.  (16)  leads  to  the  following  expressions  for  the  optimal  di screte-ti^me 
disturbance-utilizing  control  u°(nT),  n  =  0,  1,  ...,  (N-l)-  Assuming  x(nT) 
can  be  directly  measured,  the  optimal  control  u°(nT)  is  given  by 
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(17) 


where  the  matrix  P(*)  is  symmetric,  positive  definite,  and  governed  by  the 


P(N1T)=S 


(18a) 

(18b) 


Note  that  the  Riccati  difference  equation  Eq. (18)  is  automatically  set-up 
for  backward-time  solution,  "starting"  at  t  =  Tf  =  tp  +  NT  and  progressing 
backward:  t  =  tg  +  (N-l)T,  t  =  tg  +  (N-2)T,  ...,t  =  to+T,t  =  t^j.  In 
other  words,  one  successively  sets  n  =  (N-l),  (N-2),  (N-3)...,  1,  0  in 
Eq.  (18a).  The  resulting  sequence  of  values  F(nT)  is  then  stored  for  future 
playback  in  the  forward-time  control  law  expression  Eq.  07).  In  particular, 
at  each  time  t  =  tp  +  nT  the  "current"  values  of  R(nT),  B(nT),  7r(nT)  are  sub¬ 
stituted  into  Eq.  (17),  together  with  the  "one-step-ahead"  value  of 
K(n+1)T),  to  compute  the  overall  state-feedback  gain  matrix  JfC(nT,(n+l )T) 
defined  by 


(19a) 


and  uO(nT)  is  then  computed  as 


(19b) 


In  practical  applications,  the  composite  state  term  x  =  (x|c|z)  iji  Eq.  (j[9b) 
cannot  be  directly  measured  and  therefore  must  be  implemented  as  x  =  (xlcjz) 
where  x,  c,  z  are  estimates  of  x,  c,  z  generated  on-line,  in  real  time  by  a 
discrete-time  state  observer  such  as  described  in  [1;  Eqs.  (18)-(25)  and  end 
of  Section  ‘ST-D].  Note  that  the  computation  of  IIC(nT)  can  be  done  off-line 
(it  doesn't  depend  on  knowledqe  of  x(nT))  and  therefore  the  time  required 
for  accurate  calculation  of  K.  does  not  impact  on  the  real-time  performance 
of  the  controller. 


The  form  of  Eqs.  (17),  (18),  (19)  does  not  yield  much  insight  into  the  fine 
structure  of  the  optimal  disturbance-utilizing  control  u°(nT).  To  see  that 
fine  structure,  it  is  necessary  to  decompose  ?■(•)  into  smaller  blocks  cor¬ 
responding  to  the  block  sturcture  of  TT  in  Eqs.  (10),  (11).  For  that  purpose, 
we  set 


P  = 


1  _ 

K..' 

C 

Kc 

K" 

iK 

Cf 

^  J 

K„  =nxn  :  K,  =  nx-j7.  K  =nx(0 

X  ^  XC  ^  Xi  \ 


(20) 
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and  substitute  Eq.  (20),  together  with  the  expressions  for  R,  B,  A,  Q  and  S, 
into  Eqs.  (17),  (18),  (19)  to  obtain  the  optimal  control  u  (nT)  expressed 
equivalently  as  r 

u°(hT)=-[KW+B«iT)I^NT)'B(nT)]  B(hT5[l^«f  A(nT)x(nT)+l^{(wifr)Mc6'T)+ 

+(l^((n«jr)pHlhT)+iysn#)i5(nT))flnT)^ 

where  the  six  block  matrices  comprising  Eq.  (20)  obey  the  following  set  of 
coupled  matric  difference  equations 


K^NT)=0  ^22c) 
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must  first  solve  for  Kx(nT),  K  (nT),  K  (nT)  by  solving  Eqs.  (22a, b,c)  in 
backward  time  n  =  (N-1),  (N-2),...,  2,  T,  0,  using  the  indicated  "initial 
conditions"  at  t  =  tg  +  NT.  As  already  mentioned,  this  step  can  be  carried- 
out  off-line  (ahead  of  time)  and  the  computed  values  stored  for  future  use. 
Note  that  at  each  time  t  =  tg  +  nT,  the  reaj,-ti,me^distur^ance-util  izing  con¬ 
trol  Eq.  (21)  depends  on  the  values  of  R,  B,  A,  E,  FH,  ”D,  and  x,  c,  z 
evaluated  at  t  =  tp  +  nT  and  the  values  of  Ky,  ^xz  evaluated  at  the  "one- 

step-ahead"  time  t  =  tp  +  (n+l)T. 

THE  NOTIONS  OF  FIXED  COST,  ASSISTANCE,  BURDEN,  AND  UTILITY  IN  DISTURBANCE¬ 
UTILIZING  CONTROL  PROBLEMS 

The  optimal  disturbance-utij izing  control  u®(nT)  in  Eq.  (21)  achieves  the 
minimum  possible  value  of  J  in  Eq.  (9).  That  minimum  value  of  J  will  be  de¬ 
noted  by  the  scalar  function  V  =  V(x(nT) ,  c(nT),  z(nT),  (n+l)T)  where  x(nT), 
c(nT),  z(nT)  denote  arbitrary  "initial  conditions"  in  (x,  c,  z)-space. 

It  can  be  shown  [8]  that  the  function  V(x,c,z,(n+1  )T)  for  the  disturbance- 
utilization  problem  Eqs.  (9)-(ll)  has  the  explicit  form 


which  can  be  expanded  to  yield,  (Note:  all  K-expressions  in  Eqs.  (23),  (24) 
are  evaluated  at  t  =  (n+l)T,  whereas  x,c,z  are  evaluated  at  t  =  nT). 


Fixed  Cost!'  -Assistance  ^=Burden 


(24) 


The  role  of  the  disturbance  w(t)  in  reducing  the  minimum  possible  value  of  J 
can  now  be  clearly  seen  in  Eq.  (24).  Namely,  the  impact  of  w(t)  on  V  = 
min.  J  is  reflected  in  the  z-related  terms  in  Eq.  (24).  If  the  collection 
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of  terms  labeled  CZ  ("assistance")  is  greater  than  the  "burden"  term  , 

then,  and  only  then,  will  min.  J  be  further  reduced  by  the  action  of  w(t). 
Thus,  following  the  ideas  in  [6]  we  define  the  "utility"  w  of  the  dis¬ 
turbance  w(t)  as 

=As5i54jnce  -Burden  =  l^+C  (25) 

The  condition  ^>0  indicates  that  the  current  behavior  of  w(t)  is  such  that 
it  can  help  in  reducing  min.  J.  On  the  other  hand,  the  condition  U<U 
indicates  that  the  current  behavior  of  w(t)  is  such  that  w(t)  can  only  aggra¬ 
vate  (increase)  the  value  of  min.  J.  The  collection  of  terms  in  Eq.  (24) 
which  do  not  involve  z  is  referred  to  as  the  "fixed-cost"  &  because  that* 
contribution  to  V  =  min.  J.  is  invariant  with  respect  to  the  behavior  of 
disturbances  w(t). 


The  disturbance  utility  function  ^  defined  by  Eq.  (25)  can  be  studied  in 
the  (x,c,z,t)-space  to  identify  the  domains  of  positive  and  negative  utility; 
the  details  are  outlined  in  [6]  and  some  specific  examples,  from  continuous¬ 
time  disturbance-utilizing  DAC  theory,  are  presented  in  [7]  and  [9].  Note 
that  as  time  progresses,  n  =  0,1,2,...,  the  sign  of  U  can  change  back  and 
forth. 


THE  EFFECTIVENESS  &  OF  OPTIMAL  DISTURBANCE-UTILIZING  CONTROL 


The  linear-quadratic  regulator  theory  is  widely  used  to  design  feedback  con¬ 
trol  laws  of  the  form  u(-)  =  1K.(-)  x(-)-  Traditionally,  such  applications 
have  ignored  the  presence  of  persistent  disturbances  w(t)  and  therefore, 
when  confronted  with  actual  real-life  disturbances  in  the  field,  such  con¬ 
trol  laws  do  not  yield  "optimal"  performance.  Thus,  it  is  interesting  to 
study  how  much  better  the  disturbance-utilizing  control  law  performs,  com¬ 
pared  to  the  conventional  linear-quadratic  control  law,  when  the  two  closed- 
loop  systems  are  subjected  to  the  same  typical  realistic  disturbances  w(t). 

To  quantify  such  a  comparison,  Kelly  [7]  has  proposed  the  concept  of  "effect¬ 
iveness"  S  defined  for  the  discrete-time  case  as 


J.. 


100% 


(26) 


where  jin  is  the  value  of  Eq.  (9)  obtained  by  using  the  conventional  (undis¬ 
turbed)  ^discrete-time  linear-quadratic  control  law,  and  Jquq  =  V  is  the 
value  Eq,  (24)  of  Eq.  (9)  obtained  by  using  the  optimal  disturbance-utilizing 
control  law  Eq.  (21)  —  in  both  cases  the  plant  Eq.  (la,b)  is  subjected  to 
the  same  disturbance  w(t)  [as  generated  by  the  assumed  disturbance  model 
Eq.  Thus,  if  the  disturbance-utilizing  controller  Eq.  (21)  is  a  ^ 

better  performer  (as  it  should  be)  one  should  find  that  JpyQ  is  1  ess  than  Jlq 
and  therefore  <S  is  positive.  The  maximum  possiblej/alue  of  o  is  100% 
which  would  correspond  to  the  (unlikely)  case  that  Jquc  ~  Thus,  the 
closer  S  is  to  100%  the  greater  is  the  effectiveness  of  Eq.  (21)  compared  to 
the  conventional  linear-quadratic  control  law. 
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It  is  interesting  to  note  that  the  conventional  linear-quadratic  control 
law  used  in  such  comparisons  can  be  obtained  directly  from  £q.  (21)  by 
simply  setting  2(nT)  =  0,  Vn,  (and  also  c(nT)  =  if  Yci^)  =  0  the 

desired  response).  This  observation  shows  that  the  disturbance-utilizing  con¬ 
trol  law  Eq.  (21)  automatically  reduces  to  the  conventional  linear-quadratic 
control  law  whenever  the  disturbance  w(t)  vanishes.  In  other  words,  the 
matrix  Kx(‘)  Eq.  (21)  and  Eq.  (22a)  is  precisely  the  same  matrix  used  in 
the  conventional  (undisturbed)  linear-quadratic  regulator  control  law. 


EXAMPLE  DESIGN  OF  A  DISCRETE-TIME  DISTURBANCE-UTILIZING  CONTROLLER 


To  demonstrate  application  of  the  controller  design  algorithm  for  discrete¬ 
time  disturbance-utilization,  we  will  consider  a  rather  general  version  of 
a  first-order  plant  with  first-order  disturbance.  The  plant  discrete-time 
model  Eqs.  (la,  b)  and  disturbance  discrete-time  model  Eqs.  (lc,d)  are 
expressed  as 


E  a(nT) = a(nT)x(nT)+b(nT)u(hT)+f  A(nT)?(nT) + 
y(ttTj=C(nTJx(nT) 

E  ?(iiTj = d(iiT)2(nTj +6^(nT) 


(27a) 


(27b) 


(27c) 


where  x  =  scalar,  u  =  scalar,  z  =  scalar.  We  will  assume  that  the  desired 
behavior  is  set-point  regulation  to  x(nT)  =  0  Vn;  therefore  y  =  x  in  Eq. 

(4)  and  7^^=  O^in^Eq.  (5).  Thus,  we  may  set  G(t)  5  0  in  Eq.  (5).  The  para¬ 
meters  a,  'B,  fh,  d* may  be  time-varying. 


The  discrete-time  performance  index  J  in  Eq.  (9)  is  expressed  as 

(N-l)  |>o 

f>o 

R  >0 


J”  u(nTj]  ; 


(28) 


where,  in  this  example,  s',  q,  R  are  arbitrary  positive  scalars  and  e  =  -x. 

The  optimal  disturbance-utilizing  control  Eq.  (21)  for  this  example  has  the 
specific  form  ^ 


unl  =- 


a 


k(ii+i)T)x+[f^  k((n+j)T)+dk|H-!)T)] 


(29) 


R  +  b  l<((n+i)Tj 

X 

where  we  have  set  k^c  =  0  because  G(t)  =  0  (set-point  y^.  is  zero).  The 
time-varying  gains  kj((-),  kxz(-)  associated  with  the  control  Eq.  (29)  are 
computed  from  the  difference  equations  Eqs.  (22a, c)  which  for  this  example 
reduce  to 


/V  a;  2 

R  CL  i>x' 


l<(iiT)= 

^  R  +  b  l<((rH-i)T) 


k,(0iw)T)  ^ 


c  =  l 

k(KD=s  >o 


(30a) 
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;  kW’W  {30b) 
Xi 

One  can  now  compute  the  successive  values  of  kx2  fo*"  “  (N-1),  (N-2), 
(N-3),  2,  1,  0  using  Eq.  (30)  and  the  indicated  "starting"  conditions 

at  t  =t^+  nT.Those  computed  values  are  then  stored  and  used  later  in  the 
real-time  computation  of  Eq.  (29). 


Ra 


k  ((n+i)T)d +k^4-i)T)ffi 

XZ  ^ 


CONSIDERATION  OF  THE  TIME- INVARIANT  CASE  OF  THE  EXAMPLE 

If  one  assumes  that  the  plant  and  disturbance  models  Eq.  (27)  came  from  a 
time-invariant  continuous-time  plant  and  disturbance  model ,  the  preceeding 
results  Eqs.  (29),  (30)  can  be  expressed  in  more  explicit  form.  In  parti¬ 
cular,  if  Eqs.  (27)  are  assumed  to  derive  from  the  continuous-time  models 

(31a) 

(31b) 


X  =ax  +bu+f'w 
x=d?+(r(-t)  ; 


w 


where  a,  b,  f,  h,  d  are  constant  scalars,  then 


(32) 


d  = 


Using  Eq.  (32)  in  Eqs.  (29),  (30)  leads  to  the  following  explicit  expressions 
(shown  for  the  case  a  o,  d  ^  a).  The  control  law  Eq.  (29)  becomes 

Awn?! 


.7,-  1-10— 


and  Eqs.  (30)  for  the  gain  matrices  become 

Re^ly[itn)T) 


— — +5  3  k(NT)’^s 


(34a) 


-l<(iro=(9  (34b) 
Xi 


RESULTS  OF  A  SIMULATION  STUDY  OF  THE  EXAMPLE 

The  time-invariant  case,  Eqs.  (31)  -  (34),  of  the  example  was  studied  by  digital 
simulation  techniques  using  the  specific  parameter  values 
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a  =  +1 ;  b  =  +1 ;  f  =  +1 ;  h  =  +1 ;  d  =  0;  w(0)  =  w(t)  =  +1 
=  0;  =  10;  T  =  1.0  (/.N  =  10);  x(0)  =  +1 


Note  that  the  open-loop  plant  is  unstable,  with  open-loop  pole  at  +1,  and 
the  disturbance  w  is  constant  with  a(t)  =  0.  For^thi^s  study,  the  following 
three  sets  of  values  of  the  weighting  parameters  s,  q,  'k  in  the  performance 
index  Eq.  (28)  were  investigated 

Case  1 :  s  =  10  q  =  1 

Case  2:  s  =  1  q  =  10 

Case  3:  s  =  1  q  =  1 

The  resulting  behavior  of  the  optimal  disturbance-utilizing  control  u^(t) 

and  the  plant  state  x(t)  are  shown  in  Figure  1.  The  optimal  trade-offs  in 
the  performance  of  x(Tjp),  x(t),  and  u(nT),  (corresponding  to  the  relative 
magnitudes  of  the  weignting  parameters  S,  q,  n  in  Cases  1,  2,  3)  are  clearly 
evident  in  Figure  1. 


R  =  1 

R  =  1 
R  =  10 


(36) 
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ABSTRACT 

This  paper  describes  an  application  of  newly-developed  control 
theory  relating  to  the  design  of  discrete  controllers  which  ac¬ 
commodate  gaussian  disturbances  in  addition  to  disturbances  that 
possess  "waveform  structure".  Previous  work  in  this  area  applied 
linear-quadrat ic-gaussian  theory  to  terminal  homing  missile  prob¬ 
lems  with  gaussian  noise  and,  more  recently,  disturbance-utilizing 
control  theory  to  terminal  homing  missile  problems  with  "waveform- 
type"  disturbances.  This  paper  describes  a  new  application  which 
combines  features  of  both  approaches  in  terminal  homing  problems 
where  both  classes  of  disturbances  may  be  present.  The  under¬ 
lying  theory  of  this  combined  approach  is  discussed  and  the  op¬ 
eration  of  the  discrete  controller  in  a  planar  missile  inter¬ 
ceptor  engagement  model  is  illustrated  with  a  numerical  example. 


INTRODUCTION 

Algorithms  for  synthesizing  disturbance-utilizing  controllers  were 
developed  and  applied  by  Johnson  [1,  2,  3]  several  years  ago  to 
analog-type  control  problems  where  the  uncertain  external  influ¬ 
ences  on  the  plant  were  "waveform  type"  disturbances.  Just  a 
year  ago  Johnson  [4]  reported  the  development  and  application  of 
a  design  algorithm  for  synthesizing  discrete-time  disturbance¬ 
utilizing  controllers.  And  now  Johnson  [5]  has  extended  the 
discrete-time  Disturbance-Utilizing  Control  (DUC)  theory  to  in¬ 
clude  plant  and  sensor  disturbances  of  the  gaussian  noise  type 
with  known  mean  and  covariance. 

The  DUC  theory  is  based  on  the  fact  that  disturbances  may  some¬ 
times  produce  effects  which  are  beneficial  to  the  primary  control 
objectives.  For  instance,  certain  forms  of  wind  gusts  may  actu¬ 
ally  help  to  steer  a  missile  toward  a  specific  target.  Maximum 
utilization  of  a  disturbance  having  waveform  structure  can  be 
achieved  by  employing  optimal  control  theory  to  design  the  con¬ 
troller.  The  key  to  obtaining  maximum  utilization  of  distur¬ 
bances  is  to  choose  a  performance  index  J  so  that,  when  J  is 
minimized  with  respect  to  the  control  u{t),  the  primary  control 
objective  is  accomplished  and  maximum  use  of  the  disturbance  is 
achieved. 
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Uncontrolled  inputs  to  control  systems  may  be  classified  as 
either  noise-type  disturbances  or  disturbances  with  "waveform 
structure".  Thermal  noise  in  a  radar  receiver  is  an  example  of 
a  noise-type  disturbance/  while  gravity/  wind  gusts  and  elec¬ 
tronic  instrument  drift  are  examples  of  waveform-type  distur¬ 
bances.  While  noise-type  disturbances  are  characterized  by  their 
statistical  properties  (e.g./  variance  and  mean)/  waveform-type 
disturbances  can  be  modeled  by  determining  a  differential  equa¬ 
tion  that  the  disturbances  are  known  to  satisfy. 

If  the  uncertain  external  influences  on  the  controlled  process 
are  waveform-type  rather  than  noise-type  disturbances/  the  well 
known  stochastic  control  techniques  [6/7]  do  not  result  in  the 
most  effective  controller.  On  the  other  hand/  if  the  uncertain 
external  influences  are  noise-type  rather  than  waveform-type 
disturbances/  then  Johnson's  DUG  theory  is  inadequate;  the  DUG 
controller  does  regulate  the  set  point/  however  it  uses  large 
amounts  of  control  energy  trying  to  utilize  the  random  noise. 

In  some  practical  applications/  the  disturbances  acting  on  the 
plant  consist  of  a  combination  of  waveform-type  disturbances 
and  noise-type  disturbances.  Moreover/  the  sensor  measurements 
are  usually  corrupted  by  additive  noise  which  tends  to  obscure 
the  output  behavior  information.  ThuS/  Johnson  extended  the 
discrete- time  DUG  theory  to  include  those  cases  where  the  plant 
may  be  subject  to  noise-type  disturbances  and  where  the  plant 
output  sampled  measurements  may  be  corrupted  by  additive  noise. 
This  extended  theory  is  referred  to  as  discrete-time  Noisy  DUG 
theory. 

The  discrete-time  Noisy  DUG  theory  is  applied  in  this  paper  to 
the  problem  of  utilizing  disturbances  in  a  homing  missile  gui¬ 
dance  problem.  While  the  conventional  approach  to  handling  dis¬ 
turbance  effects  is  to  attempt  to  eliminate  them/  the  approach 
taken  here  is  to  formulate  the  optimal  controller  that  accounts 
for  the  waveform  properties  of  the  disturbance.  Numerical  re¬ 
sults  are  given  to  show  the  comparison  between  the  performance 
of  the  noisy  disturbance-utilizing  controller  and  a  conventional 
linear-quadratic-gaussian  controller  in  the  presence  of  both 
noise-type  and  waveform-type  disturbances. 


BAGKGROUND 

The  Noisy  DUG  theory  in  [5]  is  developed  for  plants  which  can  be 
modeled  as  linear  dynamical  systems/  with  respect  to  an  appro¬ 
priate  operating  point  or  regime.  Specifically  the  plant  is 
modeled  by: 

X  =  A(t)x  +  B(t)u  +  F^(t)w^  +  Fj^3^(t)Wj^l  (1-a) 

y  =  G(t)x  +  Vj^  (1-b) 
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where  x  =  x^^)  is  the  plant  state  vector;  y  =  {y^., 

is  the  measurement  vector;  u  is  the  control  input;  is  the 
vector  of  waveform- type  disturbances;  w^^^  vector  of  noise- 

type  disturbances  and  v^^  is  the  vector  of  measurement  noise  which 
corrupts  the  data  C(t)x  in  the  output  measurement  y(t). 

The  vector  noise  terms  v^^  in  (1)  are  assumed  to  be  white 

noise  random  processes  with  known  means  and  covariances.  Mathe¬ 
matical  models  for  the  gaussian  random  processes 

'[“nil  =  0,  E[v„]  =  0 

cov[Wj^j^  (t)  , 

cov[Vj^(t),  Vj^(t)]  =  Rj^(t)6(t  -  t) 
cov[Wni(t)  ,  v^('r)]  =  0 

where  e[*]  denotes  the  expectation  (mean  value)  operator,  cov[*,«] 
denotes  the  covariance  operator,  and  Q^j^(t),  are,  respec¬ 

tively,  non-negative  definite  and  positive  definite  symmetric 
matrices. 

The  vector  of  waveform-type  disturbances  w^(t)  in  (1)  is  modeled 
by 


w^  =  H(t)z 


(2-a) 


z  =  D(t)z  +  a{t)  + 


w 


n2 


(2-b) 


where  z  =  (z^^,  ...,  Zp)  is  the  disturbance  state  vector  and  w^2 

denotes  a  vector  white  noise  process  with  known  mean  and  covari¬ 
ance.  The  mathematical  model  for 


G[w 


n2^ 


0 


cov[Wn2(t)r  ~  '  ®n2  -  ® 

cov[Wni(t),  cov[Wj^2(t)'  ° 


The  term  a(t)  denotes  a  sparse  sequence  of  totally  unknown  im¬ 
pulses  (unknown  arrival  times  and  unknown  intensities) . 

The  mathematical  models  (1) ,  (2)  are  continuous -time  models  de¬ 
fined  for  all  time  t.  However,  to  design  a  discrete-time  digital 
controller  it  is  convenient  to  have  discrete-time  versions  of  the 
models  of  (1),  (2)  which  describe  the  behavior  of  x (t)  and  w^(t) 
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at  discrete  points  in  time  t  =  t^,  t^  +  T,  tQ  +  ^T,  where  T  is  the 

sampling  period  associated  with  the  digital  controller.  The  pro¬ 
cedures  for  developing  discrete-time  versions  of  the  mathematical 
models  (1) ,  (2)  are  described  in  detail  in  [8] .  Application  of 
those  procedures  to  (1)  and  (2)  leads  to  the  following  discrete¬ 
time  models 

Ex(nT)  =  A(nT)x(nT)  +  B(nT)u(nT)  +  F^H(nT)z(nT) 

+  F^j^(nT)w^j^(nT)  +  Y(nT)  (3-a) 

Ez(nT)  =  D(nT)z(nT)  +  a(nT)  +  F^2  Wj^2  (3-b) 

y(nT)  =C{nT)x(nT)  +  v^(nT)  (3-c) 

where  the  matrices  A,  B,  etc.  are  related  to  A,  B,  F^,  H, 

^nl'  ^  expressions  given  in  [8]  and  E  is  the  delay 

operator  r  Ex  (nT)  =  x((n+l)T).  The  terms  w^j^CnT),  v^(nT),  Wj^2 

in  (3)  represent  discrete-time  noise  processes  which  are  modeled 
as  sequences  of  zero-mean,  independent  gaussian  random  variables 
with  known  covariances 

cov[Wni{nT),  w^i(jT)]  =  Qjji6  (n  -  j) 

cov[Vj^(nT),  Vj^(jT)]  =  Rn6{n  -  j) 

cov[Wn2(nT),  ^ 

where  >  0,  >  0,  Q^2  >.  ^  may  vary  with  time. 

The  discrete-time  models  are  consolidated  into  one  composite  model 
written  as 

Ex  =  =  A(nT)x(nT)  +  B(nT)u(nT) 


+  F(nT)w^(nT)  +  6(nT) 
y(nT)  =  ^(nT)  +  v^(nT) 

where 


(4-a) 

(4-b) 

(4-c) 

(4-d) 
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Since  the  plant  motions  x(t)  governed  by  (1)  are  random  in  nature 
(due  to  the  presence  of  w^)  the  expected  value  of  the  performance 

index  J  is  to  be  minimized.  Thus,  when  E[J]  is  minimized  with 
respect  to  the  control  u(nT),  the  primary  control  objective  is 
accomplished  and  maximum  use  of  the  disturbance  w,  is  achieved. 

e[J]  is  given  in  [5]  in  terms  of  x  as 


~  ( 1~T  ~~  T  '^“(N-l) 

e[J]  =  e  <  7xMNT)Sx  (NT)  +  4  Z  [x^  (nT)Qx  (nT) 

^  n=0 

+  u'^(nT)Ru(nT)]  I  (5) 

where 

S  =  c'^SC,  Q  =  C^QC 

and  the  time  interval  of  problem  definition  (t^,  T^)  has  been 
divided  into  N  equal  segments  of  length  T.  The  weighting  matrices 
S,  Q,  R  are  symmetric  with  S^O,  Q>0,R>  0,  and  S  +  Q  >  0. 

The  discrete-time  Noisy  DUG  problem  may  be  precisely  described  as 
follows.  Given  the  composite  plant  model  (4)  find  the  control 
u  (nT)  which  minimizes  (5)  for  all  independent  random  initial 
conditions  {x (t^) ,  z(tQ)}  satisfying 

mean  x  (t^)  =  ^ (t^)  j  cov[x(tQ)]  =  R^^ 

mean  z(t^)  =  ^(t^);  cov[z(tQ)]  =  R^^ 

where  x,  z,  R^^  and  R^^  are  given.  As  is  customary  in  the  DUG 

theory,  the  sparse  terms  in  s'  in  (4)  are  neglected  in  deriving 
the  control  since  they  are  completely  unknown.  This  problem  is 
of  the  form  known  as  the  "discrete-time  linear-quadratic-gaussian 
optimal  control  problem"  which  has  been  extensively  studied  in 
the  literature  and  its  solution  is  described  in  [9] .  When  that 
known  solution  is  applied,  the  optimal,  discrete-time  disturbance¬ 
utilizing  control  u°(nT)  is  given  by 

u°(nT)  =  -[R(nT)  +  B'^(nT)p((n 

+  l)T)B(nT)]  “^[B^(nT)P  ((n 

+  1)  T)A(nT)]  X  (nT)  (6-a) 

—  — .T 

where  P  =  P  obeys 
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(6-b) 


P(nT)  =  A'^(nT)  P  ((n+l)T)A(nT)  +Q 

-  [f'^CnT)?  ({n+l)T)A(nT)]  "^[r 

+  B'^(nT)P  ((n+l)T)B(nT)]  "^[b’^CiiT)?  ((n 

+l)T)A(nT)]  ;  P(NT)  =  S 

where  x  (nT)  denotes  the  estimate  of  x (nT)  obtained  from  a  discrete¬ 
time  Kalman  filter.  The  essential  difference  between  the  noisy  and 

non-noisy  solutions  for  the  optimal  control  is  in  the  term  x (nT) . 

In  the  non-noisy  case,  x  is  generated  by  an  observer  with  rather 

arbitrary  observer  gains.  In  the  noisy  case,  x (nT)  is  generated  by 
a  precisely  described  Kalman  filter  with  specific  time-varying 
gains.  The  Kalman  filter  is  given  by 

Ex  =  A(nT)x(nT)  +  B(nT)u(nT)  +  K^ (nT)  [^y  (nT) 

-C(nT)x(nT)] 

/V  — 

x(o)  =  x(tQ)  =  (x  (t^)  I  i  (t^))  =  x(t^)  (7) 


The  Kalman  gain  matrix  K^  is  specified  by 

Kg(nT)  =  A(nT)Pj(nT)c'^(nT)  [c(nT)Pj(nT)c'^(nT) 

+  R^(nT)]"^  (8-a) 

where  the  symmetric  matrix  P^  is  governed  by  the  discrete-time 
Riccati  matrix  equation 

Pg((n+1)T)  =  A(nT)P£(nT)  A'^(nT) 

-  A(nT)Pg  (nT)c'^(nT)  [c(nT)Pj  (nT)c'^(nT) 

+  Rj^(nT)]  ■^C(nT)Pg(nT)A^(nT) 

+  F(nT)Q^(nT)  F'^(nT)  (8-b) 

with  initial  condition 


Pg(o)  =  cov[x  (t^)] 

and  where 


r\o 

0 

0 

R 

zo  J 

(8-c) 
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(8-d) 


®nl 

0 

0 

®n2 

It  is  useful  to  examine  the  detailed  structure  of  the  individual 
equations  associated  with  expressions  (6)  (7)  (8) .  Set 


r  K 

K  -| 

X 

xz 

„T 

K  „ 

K 

L  X  z 

z  J 

Substitution  of  (9) ,  (4-c)  and  (4-d)  into  (6-a)  leads  to  the  fol¬ 
lowing 

u°(nT)  =  -[R(nT)  +  B'^(nT)Kjj  ((n 

+l)T)B(nT)]  "^B'^(nT)[K^  ((n 
+l)T)A(nT)x  (nT)  +  (K^((n 

+l)T)E^(nT)  +  ((n+l)T)D(nT))  z(nT)]  (10) 

A  set  of  three  coupled  metric  difference  equations  which  govern 
the  matrices  (nT)  ,  (nT)  is  obtained  by  substituting 

(4-c) ,  (4-d)  and  (9)  into  (6-b) .  That  set  of  equations,  which  is 
lengthy,  is  documented  in  [4]  and  therefore  will  not  be  repeated 

here.  Note  that  if  z (nT)  in  (10)  were  set  to  zero  the  optimal 
disturbance-utilizing-control  would  become  the  conventional  LQG 
control. 


Detailed  expressions  for  the  Kalman  filter  are  obtained  by  sub¬ 
stituting  (4-c)  and  (4-d)  into  (7)  to  obtain 


where 


Ex  =  Ax  +  F^Hz  +  Bu  +  Kg^  (y 

-  Cx)  ;  X  (t^)  =  X  (to) 

(11-a) 

Ez  =  Dz  +  -  Cx);  Mt^) 

=  2(V 

(11-b) 

K 

L^'fzJ 

(12) 

Detailed  expressions  for  the  Kalman  gain  matrices  in  (11)  are 
obtained  by  substituting  (4-c),  (4-d),  (12)  and  the  expression 


s: 

r^xx 

Pzx'^l 

^  ^zx 

^zz  _ 
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into  (8-a)  to  obtain 


hx  =  l^xx  +  V''zx>c''t«xx=''/  <“-a) 

^fz  =  “zx^’^l^xx':''  +  <“-b) 

In  a  like  manner^  the_set  of  matric  difference  equations  governing 
the  blocks  P,,  are  obtained  from  (8-b)  as 

XA  ZA  ZZ 

®^xx  =  <“xx  +  V^zx>*’'  *  <“L  *  ^PzxKV’’’ 

-  [il 

^  j'nlQnl^nl’''  ^xx ’  Sxo 
*^^zx  =  BI^x*’’  ^  Pzz(V>''> 

-  [61  [CP^xC’'  +  Rjjl'^ral  ;  Pjxfo)  =0  (14-b) 

®^zz  -  “zz°’'  -  fBHCP^xC’'  +  R„l-^[6l’' 

*  ^n20n2fn2’''  5zz(°>  =  «zo  <“-<=> 

where 

[il  = 

#w  fn 

(6)  =  DP^^C’^ 

The  matric  equations  (14)  can  be  integrated  in  forward  time  since 
the  initial  values  P^^ (o) ,  P,^ (o) ,  P^,(o)  are  explicitly  known.  On 

AA  ZA  ZZ 

the  other  hand,  the  matrices  K  (nT) ,  K„_(nT),  K_ (nT)  must  be 

X  X  z  z 

obtained  by  backward-time  integration  of  (6-b)  since  only  the 
terminal  values  (t  =  NT)  are  known. 


APPLICATION  TO  HOMING  MISSILE  GUIDANCE 
MATHEMATICAL  MODEL 

As  an  illustration  of  the  Noisy  DUC  theory  the  controller  design 
algorithms  developed  by  Johnson  [5]  are  applied  to  a  homing  inter¬ 
cept  problem.  The  missile  is  to  be  controlled  during  the  final 
phase  of  its  flight  so  that  its  position  coincides  with  that  of  a 
target  at  a  specified  terminal  time,  even  in  the  face  of  distur¬ 
bances  which  may,  or  may  not,  be  detrimental  to  the  control 
objective . 
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The  planar  geometry  for  this  problem  is  shown  in  Figure  1,  where 
the  origin  is  located  at  the  moving  target  position  and  the  posi¬ 
tion  of  the  missile  is  defined  by  the  coordinates  (x^^,  y^^) , 
where  is  horizontal  and  is  vertical. 


It  is  convenient  to  consider  a  reference  line-of-sight  (REF  LOS) 
passing  through  the  target  and  oriented  at  a  known  angle  rela¬ 
tive  to  the  horizontal  line  The  REF  LOS  is  established  a 

priori y  and  may  correspond  to  a  desired  orientation  of  the  line-of- 
sight.  A  coordinate  Xj^  is  established  normal  to  the  REF  LOS  (Fig¬ 
ure  1)  and  it  is  assumed  that  the  missile  begins  the  homing  phase 
of  the  problem  with  a  certain  displacement  x-,  (o)  and  velocity 

X2(o)  (where  X2  =  Xj^)  normal  to  the  REF  LOS.  It  is  assumed  that  a 

previous  "midcourse"  guidance  phase  has  delivered  the  missile  to 
the  beginning  of  the  homing  phase  at  t  =  t^;  thus,  non-zero  values 
of  Xj^  (o)  and  X2(o)  characterize  the  extent  to  which  the  midcourse 

phase  has  failed  to  enable  the  missile  to  start  the  homing  phase 
under  ideal  conditions.  The  initial  range  to  the  target  and  the 
closing  velocity  are  assumed  given.  The  problem  at  hand  uses  the 
"small  LOS"  assumptions  as  in  [10]  and  considers  that  the  distur¬ 
bance  forces  of  primary  interest  are  those  acting  normal  to  REF 
LOS.  Errors  in  estimating  time-to-go  to  intercept  are  not  con¬ 
sidered  here. 

The  equations  describing  the  motion  of  the  missile  normal  to  the 
REF  LOS  are 


+ 


w 


nl 


(t) 
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X2  =  u  +  w^(t)  +  w^j^(t) 

Y  =  Xi  +  Vj^(t) 

These  equations  may  be  written  in  the  form 
X  =  +  Bu  +  +  F„^w„^ 

y  =  Cx  + 

where 


C  =  [1  0]  (15-c) 

It  is  assumed  in  the  example  to  follow  that  the  waveform-type 
disturbance  consists  of  a  linear  combination  of  constant  sequences 
and  linear  ramps: 

Wd(t)  =  +  C2t 

where  and  C2  are  unknown  constants.  The  disturbance  process  is 
written  in  state-variable  form  as 

^1  =  "d 

Z2  =  0  +  02  (t)  + 
or  in  the  form 

z  =  Dz  +  a  (t)  +  F^2''n2 
Wd  =  HZ 

where 

°  =  [“  0]' 

and  a(t)  =  [a^,  02]  a  sparse  vector- impulse  sequence  occurring 
at  unknown  instants. 


(16-a) 

(16-b) 


(15-a) 

(15-b) 
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The  corresponding  discrete-time  models  are 

Ex(nT)  =  ^(nT)  +  iu(nT)  +  F^Hz  (nT)  +  (nT) 


where 


+  j{nT) 


y(nT)  =  Cx  (nT)  +  v^(nT) 


Ez{nT)  =  Dz(nT)  +  a(nT)  + 
Wjj(nT)  =  H{nT)z(nT) 


A  = 


"1  T" 

r  2 

T^/2 

T^/6 

_o  i_ 

_T 

T^/2_ 

T  +  T^/2 


nl 


n2 


/t  +  T^/2\  ^ 

ri  Tl 

I  ;  D  = 

' 

\  T  / 

o 

The  waveform-type  disturbances  considered  in  this  problem  are 
gravity,  wind  and  target  maneuver  [10].  The  gravity  component 
acting  normal  to  the  REF  LOS  is  —32.2  cos  The  acceleration 

disturbance  due  to  wind  is  modeled  by  the  acceleration  waveform 
of  Figure  2,  acting  in  a  direction  normal  to  the  REF  LOS.  The 
target  maneuver  acting  normal  to  the  REF  LOS  has  the  waveform 
described  in  Figure  3  with  a  peak  acceleration  value  of  128  ft/sec^. 


Figure  2.  Actual  wind  disturbance  (VV’INDM)  input. 
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Figure  3.  Actual  maneuver  disturbance  (TMAN)  input. 


The  gaussian  noise  disturbances  considered  in  this  problem  are 
v^ni,  '^n  (16).  They  are  assumed  to  have  zero  means 

and  known  covariances  Qn2'  ^n'  respectively. 

CONTROL  OBJECTIVE 

The  primary  control  objective  is  to  drive  the  displacement  of  the 
missile  (normal  to  the  REF  LOS)  to  zero  at  a  specified  terminal 
time  T^;  that  is,  to  regulate  the  state  to  zero  at  t  =  T^. 

The  value  of  at  t  =  T^  is  defined  as  miss  distance.  The  sec¬ 
ondary  objective  is  to  achieve  the  primary  objective  while  effec¬ 
tively  utilizing  the  "free"  energy  of  the  disturbances.  The 
control  objectives  are  to  be  achieved  by  minimizing  the  expected 
value  of  the  performance  index 

n—N”! 

e(J]  =  e(4x^(nT)Sx  (nT)  +  T/2  z  [x^(nT)Qx  (nT) 

(2  n=0 

+  u^(nT) Ru (nT) ]| 

subject  to  the  plant  equations  (15)  and  the  waveform  disturbance 
process  equations  (16) .  The  resulting  optimal  control  u  (nT)  is 
given  by  (10) .  The  control  energy  consumption  (CEC)  is  computed 
for  analyses  as  follows: 

n=N-l 

CEC  =  T/2  Z  [u'^(nT)  Ru  (nT)  ] 
n=0 
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DISCUSSION  OF  RESULTS 

The  homing  intercept  problem  is  solved  by  applying  the  Noisy 

DUG  theory.  First  the  composite  state  vector  x,  the  composite 
system  (4) ,  and  the  expected  value  of  the  performance  index  (5) 
are  established.  Then  the  optimal  control  is  computed  by  (10) 
after  the  gains  K  (nT) ,  K„„(nT)  are  computed.  The  problem  is 

solved  on  a  CDC-6600  computer,  using  backward-time  integration 
to  find  the  initial  conditions  for  K„ ,  K„_,  and  K_,  although 

closed-form  algorithms  would  typically  be  used  to  compute  the 
gains  and  in  applications.  Estimates  of  the  states,  x 

and  z  are  obtained  from  the  discrete-time  Kalman  filter  (11) . 

A  Monte  Carlo  approach  was  developed  to  generate  the  expected 
value  of  the  performance  index.  This  approach  consists  of 
executing  a  specified  number  of  runs  where  normally-distributed 
random  variables  are  generated  each  run  for  the  initial  conditions 
of  the  plant  state  and  the  REF  LOS  orientation  angle.  The  re¬ 
sulting  miss  distance,  performance  index  and  control  energy  con¬ 
sumption  from  each  run  are  used  in  the  computation  of  means  and 
standard  deviations. 

A  sensitivity  study  was  conducted  using  both  deterministic  and 

stochastic  simulations  to  determine  "best”  values  for  S  and  Q„2* 
The  following  parameters  were  fixed  for  the  study; 


x^(o)  =  300  ft; 

¥j^(o)  =  -32.2  ft/sec^; 


X2(o)  =0.0  ft/sec 
¥2(0)  =  0*0  ft/sec^ 
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T  =  0.05  sec;  =  30  deg 

Integration  stepsize  =  0.01;  T^  =  4.5  sec.  The  missile  closing 

velocity  is  2000  ft/sec  and  at  t  ,  the  missile  distance  from  the 
target  is  9000  ft. 
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For  the  first  phase  of  the  study,  only  deterministic  runs  were 
made  to  determine  sensitivities.  In  this  analysis  of  performance 

variation  with  S,  all  the  waveform-type  disturbances  were  simulated 
and  the  disturbance  states  were  corrupted  by  the  noise  {w  j}  which 
had  the  characteristics  " 

e[Wn2l  =  0:  oov[w^2>  =  2n2  “  ^ 

The  variances  of  the  simulated  noise  terms  ,  {Wj^2} 

{v^},  computed  each  run,  were  always  nearly  1. 

The  results  of  varying  S  are  plotted  in  Figure  4  for  both  the 
conventional  linear-quadratic-gaussian  (LQG)  controller^and  the  noisy 

disturbance-utilizing  controller  (NDUC)  .  Since  c'^SC  = 

will  be  referred  to  as  S. 


NOISE  COVARIANCES 


SIMULATED  WAVEFORM  - 
TYPE  DISTURBANCES 
WIND  GUST 
GRAVITY 

TARGET  MANEUVER 


Figure  4.  Performance  as  a  function  of  terminal  weighting  S. 
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As  S  increases  the  miss  distance  approaches  a  bias  value  rather 
than  zero  (later  shown  to  be  due  to  target  maneuver) .  Figure  5 
shows  the  missile  altitude  versus  ground  range  for  the  LQG  with 

S  set  to  100.  The  missile  hits  in  front  of  the  target  for  all 

values  of  S  less  than  100,000  for  the  LQG  and  for  all  values  of 

S  less  than  5000  for  the  NDUC. 


Figure  5.  Missile  trajectory  with  terminal  weighting  S=100. 


An  S  value  which  results  in  an  acceptable  miss  distance  can  be 
picked  for  the  LQG  controller,  however,  the  control  energy  con¬ 
sumption  for  the  NDUC  with  the  same  S  is  much  lower. 

The  waveform-type  disturbance  states  (continuous)  and  estimates 
(sampled)  are  plotted  on  Figure  6.  Estimates  of  the  wind  and  the 
target  maneuver  lag  the  actual  disturbances  because  the  weighting 
Qn2  the  new  values  input  to  the  Kalman  filter  is  low.  However 

Qn2  te  increased  to  improve  the  estimate  of  the  waveform-type 

disturbance.  A  value  of  100  was  chosen  for  S  and  Q  ..  was  varied. 

11  ^ 

The  results,  plotted  on  Figure  7,  show  that  the  minimum  performance 
index  is  obtained  for  a  value  of  Q^2  equal  to  10,000.  The  lag  in 

the  target  maneuver  estimate  is  reduced  as  Q^2  increased  (see 
Figure  8).  However,  the  increased  Q^2  weighting  causes  the  wave¬ 
form  estimate  to  fluctuate  more  which  increases  the  control  energy 
consumption.  The  performance  improvement  due  to  "better"  estimates 
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is  eventually  offset  by  the  control  energy  consumption  so  that  a 
minimum  performance  index  is  obtained.-'  It  should  be  noted  that 
the  actual  noise  term  {w^2}  is  simulated  as  random  white  noise 

with  a  measured  variance  of  approximately  1.0. 


Figure  6.  Disturbance  estimate  with  S=100  and  Q^2  =  i  low). 


With  the  value  of  Q^2  set  at  10,000,  variation  of  S  is  reexamined 

and  the  results  are  presented  in  Figure  9.  Minimum  performance 

index  occurs  for  an  S  value  of  100.  Further  investigation  of  S 
variation  was  conducted  using  a  Monte  Carlo  simulation. 

The  Monte  Carlo  error  sources  were  taken  to  be  the  initial  con¬ 
ditions  for  the  plant  states  and  the  REF  LOS  orientation  angle. 
Normally  distributed  error  source  values  are  picked  from  the  com¬ 
puter  system  library  random  number  generator  based  on  the  specif i 
cations  given  in  Table  1. 


TABLE  1.  MONTE  CARLO  PARAMETERS  AND  VALUES 


VARIABLE 

MEAN 

STANDARD  DEVIATION 

X^(o) 

300.0 

15.0 

X2(o) 

o 

• 

o 

2.0 

-"h 

30.0 

5.0 
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Figure  7.  Noisy  DUG  performance  versus  choice  of  Q  „  (S=100) 
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Figure  8.  Effect  of  choice  of  disturbance  estimate  (S=100). 


The  controller  performance  as  a  function  of  S  for  Monte  Carlo 
25-run  sets  is  shown  in  Figure  10.  Although  the  "best"  mean 

values  for  miss  distance  and  performance  index  occur  for  S  equal 

to  100,  the  miss  distance  approaches  an  offset  value  when  S  is 
equal  to  200  or  250. 

Selecting  S  as  250  and  Q^2  10,000  the  disturbances  were  adjusted 

as  shown  in  Table  2  to  determine  if  the  miss  distance  offset  is 
caused  by  the  noise  or  one  or  more  of  the  waveform-type  distur¬ 
bances.  As  magnitude  of  the  target  maneuver  increases  from  0  to 

128  ft^/sec,  the  miss  distance  increases  indicating  that  the  tar¬ 
get  maneuver  causes  the  offset.  Note  that  the  no  gravity  case  has 
the  highest  values  which  clearly  indicates  the  utilization  of 
gravity  to  reduce  control  energy  requirements. 

Further  Monte  Carlo  analysis  of  the  miss  distance  offset  as  a 

function  of  S  was  performed  for  the  no-target-maneuver  case.  The 
results  (see  Figure  11)  indicate  that  250  is  an  appropriate  value 

for  S.  The  data  (shown  on  Figure  10)  generated  for  the  noise  and 
waveform-type  disturbances  also  indicates  that  250  is  a  "best" 

value  for  S. 
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A  final  comparison  is  made  between  the  LQG  and  NDUC  controllers 
for  various  cases.  The  25-run  set  Monte  Carlo  results  are  listed 
in  Table  3.  This  table  shows  the  effects  of  deleting  individual 
disturbance  inputs.  In  general,  the  NDUC  achieves  lower  misses 
when  waveform-type  disturbances  (or  waveform-type  disturbances 
plus  gaussian  noise)  are  present. 

TABLE  2.  NOISY  DUC  MISS  DISTANCE  OFFSET  FOR  VARIOUS  DISTURBANCE 
CONDITIONS 


CONDITION 

PERFORMANCE 

INDEX 

MEAN/STANDARD 

DEVIATION 

MISS 

distance 

MEAN/STANDARD 

DEVIATION 

CONTROL  ENERGY 
CONSUMPTION 
MEAN/STANDARD 
DEVIATION 

NO  GRAVITY 

31854/1884 

6. 7/1.1 

26092/1471 

NO  WIND 

28950/121 

6. 4/0.1 

23825/200 

NO  TARGET 

MANEUVER 

2323/512 

-0. 3/1.1 

2178/528 

QUARTER  OF  NORMAL 
TARGET  MANEUVER 

4287/655 

1. 3/1.1 

3942/686 

HALF  OF  NORMAL 

TARGET  MANEUVER 

9843/963 

2.9/1. 1 

8685/929 

ALL  NOISE  AND 
WAVEFORM  DISTUR¬ 
BANCES 

31732/1710 

6. 0/1.1 

27106/1500 

NO  NOISE 

31025/103 

6. 0/0.1 

26477/178 

NO  WAVEFORM- 
TYPE  DISTUR¬ 
BANCE 

3119/160 

0.9/0. 1 

3028/151 

TABLE  3.  PERFORMANCE  OF  NOISY  NDUC  VERSUS  LQG  FOR  VARIOUS 
DISTURBANCE  CONDITIONS 


CONDITION 

PERFORMANCE 

INDEX 

MEAN/STANDARD 

DEVIATION 

MISS 

DISTANCE 

MEAN/STANDARD 

DEVIATION 

CONTROL 

ENERGY 

MEAN/STANDARD 

DEVIATION 

LQG 

NDUC 

LQG 

NDUC 

LQG 

NDUC 

NO  NOISE 

430841/10258 

31025/103 

-55.8/0.7 

6. 0/0.1 

41307/1090 

26477/178 

NO  GRAVITY 

261412/413 

31854/1884 

-43.5/0.1 

6. 7/1.1 

24396/26 

26092/1471 

NO  WIND 

440596/10397 

28950/121 

-56.4/0.7 

6. 4/0.1 

42797/1132 

23825/200 

NO  TARGET 

MANEUVER 

15729/1881 

2323/512 

-10.5/0.7 

-0. 3/1.1 

1997/203 

2178/528 

NO  WAVEFORM- 
TYPE  DISTURBANCE* 

1691/161 

1691/161 

0. 9/0.1 

0. 9/0.1 

1583/151 

1581/151 

*THE  INITIAL  CONDITION  ON  Zi r  WHICH  IS  USED  IN  THE  DUC  CONTROL  LAW,  IS  SET  TO  ZERO  FOR  THIS 
CASE  ONLY. 
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Figure  11.  Monte  Carlo  performance  bounds  for  noisy  DUG  non-maneuver  case. 


CONCLUSIONS 

The  results  of  this  investigation  show  that  the  Noisy  DUC  controller 
provides  improved  performance/  when  compared  with  an  LQG  controller/ 
in  those  cases  in  which  both  "waveform-type”  disturbances  and 
gaussian  noise  are  present.  Although  the  problem  considered  here 
is  relatively  simple/  it  demonstrates  the  potential  of  the  Noisy 
DUC  approach.  In  a  following  investigation  this  technique  will  be 
used  to  design  a  control  law  which  will  be  implemented  in  a  six 
degree-of-f reedom  air  defense  simulation. 
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ABSTRACT 


This  paper  presents  a  generic  study  showing  stability  control  of 
a  large  inertia,  dynamical,  nonlinear  system  such  as  a  power 
system.  The  techniques  of  this  study  could  be  applied  to  a 
number  of  applications  including  both  Slewing  and  Position  of 
Fire  Control  Systems,  and  the  stability  of  electrical  generators 
in  the  presence  of  disturbances. 

The  stabilizing  control  is  obtained  by  a  generating,  a  priori,  a 
sequence  of  stabilizing  trajectories,  which  converges  to  a  de¬ 
sired  or  "best"  stabilizing  trajectory.  The  initial  control  is 
chosen  as  an  optimal  one  for  the  approximate  linear  case,  and 
then  an  iterative  procedure  yields  a  sequence  of  trajectories  and 
control  schemes  for  the  nonlinear  system.  The  final  controller 
is  shown  to  be  robust,  and  it  stabilizes  certain  disturbances 
that  in  the  uncontrolled  case  lead  to  unstable  system  behavior. 

An  example  of  stabilizing  a  faulted,  1000  MW,  generator  is  shown. 
Unlike  prior  studies,  a  nonlinear  model  of  the  generator  includ¬ 
ing  magnetic  saturation  was  used,  and  the  results  showed  enhanced 
transient  stability  over  previous  techniques. 
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INTRODUCTION 


Presently,  large  megawatt  synchronous  generators  are  disconnected 
from  a  power  system  when  a  major  fault  is  detected  at  its  termi¬ 
nals,  and  it  could  take  several  hours  to  reconnect  the  generator 
to  the  system.  However,  if  the  generator  was  disconnected  from 
the  system  and  the  fault  was  allowed  to  clear,  then  the  generator 
would  accelerate  for  the  time-period  it  has  no  load.  When,  after 
clearing  the  fault,  the  generator  is  reconnected  to  the  system, 
the  question  now  is  whether  the  generator  remains  in  synchronism 
or  not.  Posed  in  terms  of  control  theory  language,  the  question 
is  whether  or  not  a  stable  equilibrium  state  can  be  reached  after 
the  occurrence  of  a  fault? 

This  paper  presents  the  results  of  a  generic  study  [ 1 ]  using 
statef eedback  in  a  suboptimal  scheme  for  the  stability  control  of 
a  synchronous  generator.  The  control  action  proposed  from  this 
study  can  be  determined  prior  to  a  fault  and  is  shown  to 
stabilize  disturbances  that  in  the  uncontrolled  case  lead  to 
instability.  Since  the  synchronous  generator  is  a  large  inertia 
dynamical,  nonlinear  system  it  is  felt  that  results  of  this  study 
can  be  applied  to  similar  problems  in  slewing  and  position  of 
fire  control  systems. 

PROBLEM  DESCRIPTION 

It  is  well  recognized  that  the  optimal  control  of  nonlinear 
systems  will  often  have  to  be  obtained  from  an  iterative  scheme 
[2,3].  The  closed-form  analytical  solution,  while  most 
desirable,  is  not  known  for  nonlinear  systems  except  for  a  few 
special  cases.  The  transient  stability  problem  of  a  power  system 
is  a  nonlinear  problem  which  is  concerned  with  the  development  of 
a  stabilizing  control  law  for  a  large  inertia,  dynamical  system 
where  the  system  is  subjected  to  a  major  disturbance  and  must  be 
returned  to  an  equilibrium  state,  the  exact  nature  of  which  is 
not  known  a  priori.  Thus  the  structure  of  the  problem  is  of  the 
form:  Given  the  dynamical  system 

X  =  ( 1 ) 

determine  a  control  ^*,  that  transfers  the  system  from  the 
present  state  to  an  equilibrium  state  in  a  finite  time  Interval, 
while  minimizing  a  suitable  performance  index. 

To  solve  this  stabilization  problem  some  researchers  have  devised 
'optimal-aiming*  strategies  that  attempt  to  aim  the  system  at 
each  point  of  its  trajectory  to  a  stable  solution  [4,5,6].  The 
trouble  with  such  'pointwlse'  control  strategies  is  that  stabili¬ 
zation  of  the  nonlinear  system  is  not  guaranteed  and  in  fact 
there  exist  controllable  linear  systems  which  will  be  destabil¬ 
ized  by  such  strategies  [?]!  Clearly  such  a  strategy  is  a  poor 
choice  for  the  problem  at  hand.  Other  researchers  [8,9,10]  have 
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suggested  open-loop  schemes  for  the  transient  stability  control 
of  the  synchronous  generator.  But  open-loop  schemes  do  not  take 
into  account  any  changes  in  the  system  that  could  occur  in  the 
post-disturbance  period.  Rather  the  attempt  is  to  return  the 
system  to  exactly  its  pre-fault  status,  irrespective  of  whether 
this  is  an  equilibrium  state  or  not  for  the  post-fault  period. 

Such  strategies  can,  in  some  circumstances ,  push  the  system  into 
unstable  regions  and  not  aid  transient  stability  at  all.  A  feed¬ 
back  control  strategy  that  continually  obtains  information  on  the 
states  avoids  this  pitfall.  In  this  study  the  control  is  applied 
as  a  state-dependent  feedback  control  that  attempts  to  force  the 
system  to  a  closer  target  trajectory  leading  to  an  equilibrium 
state.  The  problem  is  essentially  the  tracking  problem  described 
by  Athans  and  Falb  [11]  except  that  the  system  is  nonlinear  and_ 
that  a  desired  target  set  is  not  known  exactly  but  must  be  speci¬ 
fied  by  exercising  engineering  judgement. 

In  the  study,  the  model  of  the  synchronous  generator  used  is  not 
the  conventional  d-q-o  axis  model,  but  a  direct-phase  reference 
model.  The  details  of  the  direct-phase  model  are  given  in  [12]. 

It  will  suffice  to  state  here  that  compared  to  the  conventional 
model,  this  model  gives  system  states  that  are  conceptually  more 
meaningful  and  physically  measurable.  After  rearranging 
Kirchhoff's  mesh  equation  and  Newton’s  equation  of  motion  for 
rotating  bodies  the  nonlinear  state  vector  equation  is  obtained  as 

X(t)  =  A(X,t)X(t)  +  B(X,t)U(t)  +  V(t)  (2) 

where 

X  is  the  state  vector  composed  of  the  rotor-angle  and 
~  currents  of  the  generator, 

U  is  the  control  vector  consisting  of  the  excitation 
~  voltage,  and  prime-mover  torque, 
and  V  is  the  vector  that  accounts  for  bus-voltage 

effects. 

It  is  important  to  note  that  the  matrices  A  and  B  have  elements 
that  are  not  only  time-varying  but  also  non-linear  in  the  state 
X.  Further,  it  should  be  noted  that  since  the  system  trajectory 
covers  a  wide  range  of  states,  linearization  about  a  nominal 
steady-state  will  not  yield  meaningful  results.  For  example,  it 
makes  little  sense  to  linearize  the  synchronous _ generator 
parameters  at  a  nominal  excitation  when  excitation  voltage  is 
chosen  as  one  of  the  control  variables  and  is  to  be  varied  over 
its  full  range. 

SUBOPTIMAL  SYNTHESIS  OF  THE  CONTROLLER 

Stability  of  the  synchronous  generator  is  most  easily  depicted  as 
the  rotor  angle  behavior  following  a  system  fault.  Figure  1  is 
an  example  of  both  stable  and  unstable  behavior.  In  the  unstable 
case,  the  rotor  angle  grows  large  without  bound,  while  in  the 
stable  case,  the  response  following  a  disturbance  decays.  The 
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The  aim  of  a  controller  would  be  to  make  a  system  that  has  the 
unaugmented  response  of  the  unstable  case  to  follow  the  response 
of  that  in  the  stable  case,  namely  damped  oscillations.  Thus  a 
damped  oscillation  of  the  rotor  angle  in  the  post  disturbance 
interval  is  to  be  chosen  as  part  of  the  target  trajectory. 
Following  the  outline  for  the  tracking  problem  [11],  a  suitable 
quadratic  performance  index  in  terms  of  the  error  from  the  target 
set  is  formulated  as 

J  =  <^(tf),  £ 


+  %  <e(t) ,Q(t)e(t)>+<U(t) ,R(t)U(t)>dt  (3) 

where  ^ 

<,>  =  an  inner  product, 

[tQ,tf]  =  the  time  interval  for  control  action, 

F  =  a  symmetric  weighting  matrix, 

£(t)  =  a  semipositive  definite  weighting  matrix, 

R(t)  =  a  positive  definite  weighting  matrix, 
e(t)  =  [Z(t)  -  Y(t)] 

=  [^(t)  -  £(t)X(t)]  is  the  error, 
with  ^(t)  a  target  vector  and  Y(t)  =  £(t))((t)  the 
observation  vector. 

The  optimal  control  for  the  linear  control  problem  is  known  to 
be 

U*(t)  =  R-1(t)  BT(t)  [G(t)  -  K(t)  X(t)]  and  (4) 
where  ^(t)  is  the  solution  of  the  matrix  differential  equaton 
K(t)  =  -  K(t)A(t)  -  A'’^(t)K(t) 

+  K(t)B(t)R“'' (t)BT(t)K(t) 

-  cT(t)Q(t)C(t)  ,  (5) 

with  boundary  condition 

K(tf)  =  C^T(tj.)  F  C(tf)  (6) 

the  vector  £(t)  is  the  solution  of  the  linear  vector  differential 
equation 

G(t)  =  K(t)B(t)R-'' (t)  -  AT(t)  G(t) 

+  K(t)V(t)  -  c'r(t)Q(t)Z(t)  ,  (7) 

with  boundary  condition 

G(tf)  =  cT(tf)  F  Z(tf) 
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Since  the  backward-time  solutions  of  Equations  (5)  and  (7) 
require  knowledge  of  the  system  trajectory  a  'seed'  case  must 
first  be  obtained.  The  procedure  for  starting  the  iterative 
scheme  is  then  to  obtain  a  system  trajectory  with  no  applied 
control.  This  is  shown  in  Figure  2  where  the  system  was  faulted 
for  six  cycles  before  the  fault  was  cleared.  With  this  6  cycle 
fault,  the  system  is  stable  in  the  sense  that  rotor  swings  do  not 
increase  in  amplitude.  That  the  swings  do  not  decay  either  is 
due  to  the  fact  that  sufficient  damping  was  not  included  in  the 
synchronous  generator  model,  making  the  results  of  the  study 
somewhat  conservative.  Whereas  in  the  practical  system,  there  is 
viscous  damping  of  the  rotor  that  would  enhance  decay  and 
therefore  the  system  would  stabilize  faster.  The  'seed* 
trajectory,  X°U),  is  used  to  obtain  solutions  of  Equations  (5) 
and  (7)  namely  K''(t)  and  G''(t).  These  matrices  can  now  be 
used  to  construct  the  control  of  Equation  (4),  ^'(t).  Since  the 
system  is  nonlinear  U^Ct)  may,  in  general,  not  be  the  optimal 
control,  and  another  trajectory  X^t)  is  obtained.  Equations 
(5)  and  (7)  are  solved  once  again  to  obtain  solutions  K"^(t)  and 
£^(t),  and  a  control  U^(t)  can  now  be  constructed.  The 
Tterative  scheme  is  obvious  and  a  sequence  of  controllers, 

U^'Ct),  is  constructed  until  a  satisfactory  system  trajectory 
T*^(t)  is  obtained. 

Figure  3  shows  that  a  satisfactory  trajectory  is  obtained  in  just 
two  iterations,  while  Figure  4  shows  the  associated  control. 

Note  that  a  disturbance  of  15  cycle  duration  has  been  stabilized 
in  Figure  3.  This  disturbance  destabilizes  the  trajectory  shown 
in  Figure  2.  The  magnitude  of  either  control  variable  in  Figure 
4,  is  not  unacceptably  large.  The  60Hz  component  in  the  excita¬ 
tion  voltage  is  really  a  reflection  of  the  imbalance  in  the 
stator  currents,  not  a  control  effort,  and  this  component  is  seen 
to  be  decaying. 

To  check  for  the  robustness  of  the  controller,  it  was  used  to ^ 
stabilize  the  system  starting  from  a  different  initial  condition 
than  that  was  used  in  constructing  the  original  controller.  The 
results  are  shown  in  Figure  5,  and  it  can  be  seen  that  successful 
control  action  is  obtained. 

The  important  difference  between  this  control  scheme  and  other 
ones  reported  by  researchers  earlier  [6,8,9,10]  is  that  this 
control  action  is  applied  after  the  clearing  of  the  fault  and  not 
immediately  upon  the  detection  of  the  fault.  The  advantage  being 
that  control  action  can  be  applied  only  to  disturbances  for  which 
it  is  known  that  stabilizing  action  will  be  successful.  This 
selective  application  is  crucial,  for  in  a  power  system,  often 
there  are  disturbances  where  it  is  better  to  sacrifice  system 
integrity  and  maintain  system  security  (load  shedding),  than  to 
attempt  to  maintain  system  integrity  and  lose  system  security 
(black-out).  It  is  the  state-dependent  nature  of  the  control 
action  that  permits  this  selectivity  and  the  feedback  application 
results  in  enhanced  transient  stability  margins.  Compared  to 


268 


open-loop  strategies,  the  advantage  of  the  feedback  scheme  is 
that  action  can  be  applied  only  to  those  disturbances  for  which 
the  system  does  not  have  inherent  stability.  Open-loop  schemes 
apply  control  effort  indiscriminately  to  every  disturbance 
whether  the  system  possesses  inherent  stability  or  not.  For  a 
power  system  where  the  effect  of  frequent  stresses  must  be 
considered  on  the  life  cycle  of  very  expensive  equipment  this  is 
an  important  consideration  in  selecting  a  control  strategy. 

CONCLUSION 

The  applicability  of  the  methods  of  linear  optimal  control  to  a 
nonlinear  system  has  been  demonstrated.  The  control  scheme  is 
obtained  by  an  iterative  procedure  and  is  shown  to  both  improve 
stability  margin  and  stabilize  major  disturbances.  A  synthesis 
of  the  power  systems  and  control  systems  approaches  to  the  tran¬ 
sient  stability  problem  was  achieved  by  allowing  the  generator 
and  the  external  power  system  to  have  a  common  set  of  state  vari¬ 
ables.  As  mentioned  previously,  it  might  be  possible  to  apply 
these  same  techniques  to  enhance  the  control  of  both  slewing  and 
position  control  of  five  control  systems. 
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